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ized to  reveal  a large  amount  of  parallelism,  which  can  be  exploited 
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Chapter  1 
INTRODUCTION 

Computers  now  exist  which  are  capable  of  performing  several 
independent  operations  simultaneously  [3,7,10,24,25,31].  This  fact  is 
the  motivation  for  much  recent  research  into  problem  solving  methods 
which  take  advantage  of  this  parallelism  to  minimize  computation  time. 
Such  methods  have  been  developed  not  only  for  problems  like  vector  ad- 
dition, which  is  inherently  parallel,  but  also  for  problems  such  as 
solving  a system  of  linear  equations  [8],  evaluating  an  arithmetic 
expression  [16],  and  sorting  an  array  [18].  Most  recently,  graph 
problems  have  been  considered  and  algorithms  for  finding  the  connected 
components  of  a graph  have  been  developed  [2,11].  This  paper  is  a 
study  of  parallelism  in  other  graph  theoretic  problems.  The  problems 
considered  include  finding,  for  a given  graph,  a minimum  spanning  tree, 
a cycle  basis,  the  biconnected  components,  the  bridges,  and  the  domin- 
ators.  For  each  problem,  a parallel  algorithm  is  developed  and  ana- 
lyzed. In  addition,  theoretical  lower  bounds  on  the  time  required  to 
solve  the  problems  are  found.  The  model  of  computation  which  is  used 
Is  the  unbounded  parallel  model  discussed  in  Section  1.1. 

1 .1  Models  of  Computation 

A meaningful  study  of  the  development  and  analysis  of  algo- 
rithms in  which  several  operations  may  be  performed  simultaneously  re- 
quires a precise  model  of  computation.  Existing  parallel  machines 


differ  widely  in  their  characteristics  and  the  study  of  their  operation 
is  complicated  by  hardware  considerations.  Although  it  is  of  practical 
interest  to  develop  algorithms  to  solve  problems  on  a particular  com- 
puter, the  structure  and  performance  of  these  algorithms  will  depend 
not  only  on  the  problem  at  hand,  but  on  the  advantages  and  limitations 
of  the  computer  as  well.  A more  fundamental  approach  to  parallel  com- 
puting is  to  study  the  structure  of  problem  solutions  and  seek  ways  to 
reorganize  solutions  so  that  operations  can  be  performed  simultaneously 
to  reduce  the  computation  time.  Two  commonly  used  models  of  computa- 
tion, which  are  suitable  for  this  approach,  are  discussed  below. 

Let  B be  a set  of  binary  operations,  which  include  binary 
comparisons.  The  k-parallel  model  consists  of  k identical,  indepen- 
dently controlled  processors,  each  of  which  is  capable  of  performing 
operations  in  B.  Each  processor  has  access  to  an  arbitrarily  large  com- 
mon memory.  The  k-parallel  model  operates  synchronously,  in  a sequence 
of  steps,  called  time  units.  Initially,  the  input  is  assumed  to  be 
stored  in  the  memory.  During  any  time  unit,  each  processor  may  take 
its  operands  from  the  memory,  perform  an  operation  in  B,  and  store  the 
result  of  the  operation  in  the  memory.  During  no  time  unit  may  two  pro- 
cessors store  a result  in  the  same  memory  location,  although  they  may 
read  the  content  of  the  same  memory  location. 

The  unbounded  para  He1  model  is  the  same  as  the  k-parallel 
model,  except  that  the  number  of  processors  is  unbounded.  The  unbounded 
parallel  model  of  computation  is  the  one  which  will  be  used  in  this 
paper.  Notice  that  this  model  is  sufficiently  general  to  handle  adaptive 


operations.  For  example,  let  b be  a function  of  n variables,  Xp 
. . . , xn,  where  b(x1 , . . . , xn)  e {0,1}.  Assume  that  b can  be  com- 
puted on  t'se  unbounded  parallel  model,  given  (x^  . . .,  xn).  Let 
g(yr  . . . , ym)  and  h(z-|,  . . . , zk)  be  arbitrary  functions  which  can 
be  computed  on  the  unbounded  parallel  model,  given  (y^ , . . .,  ym) 

and  (z, , . . . , zk),  respectively.  For  given  values  of 

xl’  • * *’  V yl ym*  Z1  ’ * • *’  zk’ 

let  f be  a function  which  has  the  value  g(y-j,  . . .,  ym)  if 

b(x-j , . . .,  xn)  = 0 and  h(z-j,  . . . , zk)  if  b(x^ , . . .,xn)  = 1.  Then 

f(xr  . . .,  xn,  y-, , . . ym,  zr  . . .,  zk) 

= (l-b(x-|,  . . .,  x^))  g(y-j,  . • •»  y^) 

+ b(x-j , . . .,  xn)h(zr  . . .,  zj. 

Thus,  f can  be  computed  on  the  unbounded  parallel  model,  under  the  as- 
sumption that  the  set  B contains  the  operations  of  addition,  subtrac- 
tion, and  multiplication. 

1.2  Complexity  Measures 

In  order  to  evaluate  the  performance  of  an  algorithm  devel- 
oped to  solve  a problem  on  a particular  model  of  computation,  parameter 
are  needed  to  measure  its  efficiency.  On  the  two  models  above,  mean- 
ingful measures  of  the  complexity  of  an  algorithm  are  the  number  of 
time  units  used  and  the  amount  of  storage  required  to  perform  the 


algorithm.  Another  measure,  on  the  unbounded  parallel  model,  is  the 
maximum  number  of  processors  actually  used  to  execute  a given 
algorithm.  The  time,  storage,  and  number  of  processors  used  by  an 
algorithm  which  solves  a certain  class  of  problems  may  be  less  for 
"small"  problems  in  the  class  than  for  "large"  ones.  It  is  thus  im- 
portant to  have  a measure  of  the  "size"  of  a problem  and  to  determine 
the  time,  storage,  and  number  of  processors  used  by  an  algorithm  to 
solve  a given  problem  as  a function  of  the  size  of  the  problem. 

Let  C be  a class  of  problems  for  which  some  appropriate  mea- 
sure of  size  has  been  determined.  (For  example,  the  measure  of  the 
size  of  a problem  for  a graph  could  be  the  number  of  vertices  of  the 
graph.  The  size  of  a polynomial  evaluation  problem  could  be  the  de- 
gree of  the  polynomial.)  For  p e C,  let  |p|  be  the  size  of  p.  Let  A 
be  an  algorithm  which  solves  problems  in  C on  a k-parallel  or  unbounded 
parallel  model.  For  p e C,  let  t(p)  be  the  number  of  time  units,  s(p), 
the  number  of  memory  locations,  and  q(p),  the  number  of  processors  used 
to  solve  problem  p using  algorithm  A.  Then  the  time  complexity  of  the 
algorithm  A is  a function  T^,  defined  on  the  set  X = (m  | m is  a non- 
negative integer  and  for  some  p e C,  |p|  = m},  where  for  n e X, 

T^(n)  = max  (t(p)  | p e C and  |p|  = n}. 

The  storage  complexity  of  A is  the  function  S^,  where 

S^(n)  = max  {s ( p)  | p e C and  |p|  = n} 


and  the  processor  complexity  of  A is  the  function  QA,  where 


5 


QA ( n ) ■ max  {q(p)  | p c C and  | p J ■ n). 

The  time,  storage,  and  processor  complexities  provide  ways  to 
evaluate  and  compare  algorithms  developed  for  Implementation  on  the  k- 
parallel  or  unbounded  parallel  model.  It  Is  also  of  Interest  to  compare 
the  complexity  of  a given  parallel  algorithm  to  the  complexity  of  a 
sequential  algorithm  which  solves  the  same  problem.  Sequential  algorithms 
have  been  studied  extensively  and  the  most  commonly  used  model  of  compu- 
tation is  a random  access  machine  (RAM)  [1],  A RAM  Is  similar  to  a 
l-parallel  processor  where  Input  Is  read  from  an  Input  tape,  rather 
than  from  the  memory.  Any  algorithm  designed  to  be  executed  on  a RAM 
In  time  T can  be  modified  to  work  on  the  1-parallel  model  In  time  at 
most  T. 

In  developing  efficient  algorithms  on  a k-parallel  model,  the 
primary  goal  Is  to  find,  for  a given  problem,  an  algorithm  of  minimum 
time  complexity,  using  the  k processors  available.  Let  A be  a k-parallel 
algorithm  with  time  complexity  T(n).  During  execution  of  A,  at  most  k 
binary  operations  are  performed  In  any  time  unit.  Modifying  A so  that 
these  operations  are  performed  sequentially,  we  obtain  a sequential 

I 

algorithm,  A , which  solves  the  same  problem  as  A and  has  time  complex- 
ity at  most  k-T(n) . Thus,  if  the  best  serial  algorithm  for  a given 
problem  has  time  complexity  f ( n) , the  goal  Is  to  find  a k-parallel  al- 
gorithm of  time  complexity  which  Is  bounded  above  by  c»f(n)/k,  for 
some  constant  c.  When  a "fast"  algorithm  Is  found,  It  Is  desirable  to 
Increase  Its  efficiency  by  minimizing  the  storage  complexity. 


6 


When  an  unbounded  parallel  model  is  used  as  the  model  of  com- 
putation, again  the  goal  is  to  find,  for  a given  problem,  an  algorithm 
of  minimal  time  complexity,  regardless  of  how  many  processors  must  be 
used.  When  a "fast"  algorithm  has  been  found,  it  Is  of  practical  in- 
terest to  try  to  minimize  the  number  of  processors  used  without  in- 
creasing the  time  complexity  by  more  than  a constant  factor. 

Before  we  mention  some  results  which  have  been  obtained  using 
the  unbounded  parallel  model  of  computation,  the  "0"  notation  should  be 
defined.  Let  f and  g be  nonnegative  real  valued  functions  defined  on 
the  set  of  nonnegative  Integers.  Then  g(n)  = 0(f(n)),  read  "g(n)  is  of 
order  f(n),"  if,  for  some  constant  c,  g(n)  < cf(n)  for  all  but  finitely 
many  values  of  n [1].  Expressions  like  "algorithm  A takes  time  0(f(n))“ 
will  be  used  throughout  this  paper  to  mean  "the  time  complexity  of  al- 
gorithm A is  0(f(n)) ." 

1 .3  Previous  Results 

To  show  the  speedup  in  computation  time  which  can  result  by 
solving  a problem  in  parallel,  and  to  see  the  number  of  processors  needed 
to  achieve  such  a speedup,  some  results  which  have  been  obtained  are 
displayed  In  Table  1.  For  each  problem  listed,  the  table  gives  the  time 
and  processor  complexities  of  the  best  known  parallel  algorithm  which 
solves  the  problem  on  the  unbounded  parallel  model.  For  comparison, 
the  time  complexity  of  the  fastest  known  sequential  algorithm  for  each 
problem  is  given. 
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1 .4  Outline  of  Thesis 

In  this  paper,  parallel  algorithms  are  presented  for  several 
graph  problems,  based  on  the  unbounded  parallel  model  of  computation 
presented  in  Section  1.1.  Although  parallelism  in  graph  problems  has 
only  recently  been  considered,  sequential  algorithms  for  graph  prob- 
lems have  been  studied  extensively  and  efficient  algorithms  exist  for 
many  problems,  including  finding  the  biconnected  components  of  a graph 
[1],  finding  a minimum  spanning  tree  of  a graph  [32],  and  deciding 
whether  a graph  is  planar  [14].  Unfortunately,  one  of  the  major  tools 
used  in  efficient  sequential  algorithms,  the  depth  first  search  of 
Tarjan  [28],  cannot  be  adapted  to  yield  efficient  algorithms  on  a model 
with  unbounded  parallelism.  On  the  other  hand,  some  inefficient  serial 
algorithms  can  be  transformed  into  fast  parallel  algorithms.  The  al- 
gorithms presented  in  this  paper  have  been  developed  by  combining  se- 
quential techniques  with  new  techniques,  designed  to  exploit  parallel- 
ism. 

The  graph  problems  which  will  be  considered  here  are  the 
following.  For  a given  connected,  undirected  graph  G,  find  a spanning 
tree  of  G,  a cycle  of  G,  a cycle  basis  for  G,  the  bridges  and  bridge 
connected  components  of  G,  and  the  biconnected  components  of  G.  For  a 
given  connected,  undirected,  weighted  graph  G,  find  a minimum  spanning 
tree  of  G.  For  a given  connected,  directed  graph  G,  find  a cycle  of 
G,  a shortest  cycle  of  G,  and  the  dominators  and  dominator  tree  of  G. 
Using  the  adjacency  matrix  of  the  graph  as  input  (or  the  weight  matrix 
in  the  case  of  a weighted  graph),  we  develop  parallel  algorithms  of 
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time  complexity  0(log  >3)  to  solve  each  of  these  problems,  where  n is 
the  number  of  vertices  of\the  graph.  It  is  shown  that  for  each  al- 
gorithm, there  is  a polynomial  in  n which  is  an  upper  bound  on  the 
number  of  processors  required  to  execute  the  algorithm  on  a graph  with 
n vertices. 

Lower  bounds  for  graph  problems  are  discussed  in  Chapter  8. 

It  is  shown  for  many  problems  that  any  algorithm  which  solves  the  prob- 
lem on  the  unbounded  parallel  model,  using  the  adjacency  or  weight 
matrix  as  input,  requires  time  at  least  2 log  n + c,  for  some  con- 
stant c. 

It  is  of  interest  to  compare  the  time  and  processor  complex- 
ities of  the  parallel  algorithms  presented  here  with  the  time  complex- 
ity of  the  corresponding  sequential  algorithm.  Let  T(n)  and  P(n)  be 
the  time  and  processor  complexities,  respectively,  of  a parallel  al- 
gorithm, A,  for  a certain  problem.  Let  T^(n)  be  the  time  complexity 

I 

of  the  fastest  sequential  algorithm,  A , which  solves  the  same  problem. 
The  ratio  S(n)  = T-|(n)/T(n),  called  the  speedup,  is  a measure  of  the 
decrease  in  computation  time  which  results  by  using  algorithm  A in- 

I 

stead  of  algorithm  A . To  compare  the  overall  efficiency  of  algorithms 

I I 

A and  A , note  that  A performs  at  most  T-|(n)  operations,  while  A per- 
forms  at  most  T(n)  • P(n)  operations.  The  ratio  E(n)  = S(n)/P(n), 
called  the  efficiency,  is  a comparison  of  the  total  work  done  by  both 
algorithms.  A small  function  E(n)  (for  example,  E(n)  5 1/n  ) indicates 
that  a large  amount  of  waste  is  incurred  by  using  P(n)  processors  to 
solve  the  given  problem. 
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Each  graph  problem  considered  here  can  be  solved  by  a sequen- 
tial algorithm  with  time  complexity  bounded  above  by  a polynomial 
function  of  the  number  of  vertices  of  the  graph.  It  has  been  shown 
for  sequential  algorithms  that  different  representations  of  a problem 
can  yield  algorithms  of  different  time  complexities.  (For  example, 

it  can  be  shown  that  if  a graph  G,  with  n vertices,  is  represented  by 

2 

its  adjacency  matrix,  at  least  cn  time  units  are  required  to  deter- 
mine whether  or  not  G is  planar  [22].  On  the  other  hand,  Hopcroft  and 
Tarjan  [14]  have  developed  an  algorithm  which  determines  planarity  in 
time  0(n)  using  an  adjacency  list  representation  [1]  of  G.)  Since  all 
algorithms  presented  here  use  an  adjacency  or  weight  matrix  representa- 
tion of  a graph,  it  is  most  appropriate  to  compare  them  to  sequential 
algorithms  which  use  the  same  representation.  But  it  is  also  of  general 
interest  to  compare  the  parallel  algorithms  to  the  fastest  known  serial 
algorithms.  Thus,  whenever  possible,  the  time  complexities  of  the 
fastest  known  sequential  algorithms  in  both  cases  will  be  given,  for 
comparison  with  the  corresponding  parallel  algorithm. 

Definitions,  results,  and  basic  algorithms  which  are  used  in 
this  paper  are  discussed  in  Chapter  2. 


Chapter  2 


DEFINITIONS  AND  BASIC  ALGORITHMS 

2.1  Definitions 

An  undirected  graph,  G = (V, E),  consists  of  a finite,  non- 
empty set  V of  vertices  and  a set  E of  unordered  pairs,  (x,y),  of 
elements  of  V,  called  edges.  If  (x,y)  e E,  x and  y are  adjacent  and 
(x,y)  is  incident  with  x and  y.  G is  simple  if  (x,x)  t E for  all  x e V 
It  will  be  assumed  that  all  undirected  graphs  in  this  paper  are  simple. 
A graph  H = (V^.E^)  is  a subgraph  of  G,  written  H *=  G,  if  VH  * V and 
Eh  6 E.  A path  in  G is  a subgraph  P = (Vp,Ep)  of  G where 
Vp  = {x-,,  . . .,  xk>  and  Ep  * {(x^,xi+1)  | i = 1 , . . . , k - 1 } . The 

path  P joins  x-|  and  xk  and  the  length  of  P is  k - 1.  We  will  sometimes 

represent  P as  a sequence  x^,  . . .,  xk.  The  path  P is  simple  if 
x.  f Xj  for  i,j  e {1,  . . .,  k};  P is  a cycle  if  x^  = xk,  k > 4,  and 
x^,  . . .,  xk_]  is  a simple  path.  The  union  of  two  graphs,  G = (V,E) 
and  G = (V  ,E  ),  is  the  graph  G D G = (V  U V ,E  U E ).  In  this  paper 

the  vertices  of  a graph,  G = (V,E),  with  n vertices  will  usually  be 

identified  with  the  integers  1 through  n,  that  is,  V = {1,  . . .,  n}. 
The  adjacency  matrix  of  G is  an  n x n matrix  A,  where  for  i,  j e V, 

!1,  if  (i.j)  e E 

0,  otherwise. 

A directed  graph,  G = (V,E),  consists  of  a finite,  nonempty 
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set  V of  vertices  and  a set  E of  ordered  pairs,  (x,y),  of  elements  of  V, 
called  edges.  If  (x,y)  e E,  then  x is  the  tail  of  (x,y),  y is  the  head 
of  (x,y),  and  (x,y)  leaves  x and  enters  y.  The  graph  G is  simple  if 
(x,x)  t E for  all  x e V.  Subgraphs  and  union  of  graphs  are  defined 
for  directed  graphs  as  for  undirected  graphs.  A path  P in  G is  a sub- 
graph P = (Vp.Ep)  of  G where  Vp  = {x-j,  . . .,  xk>  and 
Ep  = { (x^  ,x  ._j_i ) | i * 1,  ....  k - 1};  P is  a path  from  x-|  to  xk  and 

the  length  of  P is  k - 1.  We  will  sometimes  represent  P as  a sequence 

xl  -*■  x2  • • . -*■  x^.  The  path  P is  simple  if  x^  ? Xj  for  i,  j e {1, 

. . .,  k};  P is  a cycle  if  x-j  = xk  and  the  path  x^  Xg  -*■  . . . -+  xk_^ 

is  a simple  path.  If  G = (V,E),  where  V = {1 , . . .,  n},  the  adjacency 
matrix  of  G is  the  n x n matrix  A,  where  for  i,  j e V 

A(i,j)  = 

An  undirected  (directed)  graph  G = (V,E)  is  connected  (strongly 
connected)  if  for  all  i,  j e V there  is  a path  in  G joining  i and  j (from 
i to  j).  A connected  component  (strongly  connected  component)  of  G is 
a maximal  connected  (strongly  connected)  subgraph  of  G. 

A forest  is  an  undirected  graph  which  contains  no  cycles.  A 
connected  forest  is  a tree.  A rooted  tree,  (T,r),  is  a tree  T * ( V , E) 
with  a distinguished  vertex  r e V.  If  1 e V and  i f r,  the  father  of  i 
in  (T,r)  is  the  vertex  j e V which  is  adjacent  to  i on  the  shortest 
path  in  T joining  i and  r.  Then  i is  the  son  of  j.  If  j lies  on  the 

i i 

l. 


rl,  if  (i.j)  £ E 


lO,  otherwise. 
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shortest  path  in  T joining  1 and  r,  then  j is  an  ancestor  of  i and  i 
is  a descendant  of  j.  If  i t j,  j is  a proper  ancestor  of  i and  i is 
a proper  descendant  of  j . In  this  paper,  a directed  rooted  tree,  (T,r), 
is  defined  to  be  a directed  graph  T = (V,E)  with  a distinguished  ver- 
tex r e V,  in  which  there  is  no  edge  of  E leaving  r and  for  each  i e V 
with  i t r,  there  is  a unique  path  in  T from  i to  r.2  If  (i,j)  e E,  j 
is  the  father  of  i and  i is  the  soji  of  j.  If  there  is  a path  in  T from 
i to  j then  i is  a descendant  of  j (proper  descendant  if  i j)  and  j 

i s an  ancestor  of  i ( pro Rer  ancestor  if  i t j ) . A directed  forest  i s a 

/ 

union  of  directed  rooted  trees  with  pairwise  disjoint  vertex  sets.  Let 
(T,r)  be  a rooted  tree  or  a directed  rooted  tree,  where  T = (V,E).  For  i, 
j e V,  the  youngest  common  ancestor  of  i and  j in  T is  the  vertex 
k e V such  that  k is  an  ancestor  of  i and  j and  if  v e V is  an  ancestor 
of  i and  j,  then  v is  an  ancestor  of  k. 

Let  R be  a binary  relation  on  a set  S.  Then  R is  a total 
order  (or  linear  order)  of  S if  R is  reflexive,  antisymmetric,  and 
transitive  and  for  all  x,  y e S,  either  x.  R y or  y R x.  The  minimum 
of  S with  respect  to  R,  written  minR  S,  is  the  element  x e S such  that 
x R y for  all  y e S.  The  maximum  of  S,  maxR  S,  is  defined  analogously. 

Let  1R  be  the  set  of  real  numbers  and  Z,  the  integers.  The 
ceiling  function  is  a function  f : IR  -*■  Z,  where 

f(x)  (written  fxl)  = min  (z  e Z | x < z}. 


2Tarjan  calls  this  an  anti-tree.  In  his  definition  of  directed 
rooted  tree,  the  direction  of  each  edge  is  reversed. 


The  floor  function  is  a function  g : IR  •+  Z,  where 


g(x)  (written  LxJ)  = max  {z  c Z | z < x}. 

Inequalities  involving  logarithms  and  the  ceiling  function 
are  used  throughout  this  paper.  The  following  results  will  be  used 
casually,  when  they  are  needed. 

1.  There  is  a constant  c such  that 
Txl  < cx  for  all  x > 1 

2.  Let  k be  any  number.  Then  there  is  a constant  c such 
that  x + k < cx  for  all  x Z 1 . 

Other  definitions  will  be  made  throughout  the  paper  as  the 
need  arises. 

2.2  Basic  Algorithms 

In  this  paper,  parallel  algorithms  are  sought  for  various 
graph  problems.  The  strategy  used  to  develop  an  algorithm  for  a given 
problem  is  to  devise  a technique  which  reduces  the  solution  of  the 
problem  to  the  solution  of  a sequence  of  problems  for  which  efficient 
parallel  algorithms  are  known.  In  anticipation  of  such  later  use,  some 
problems  for  which  parallel  algorithms  have  been  developed  are  dis- 
cussed below. 

Finding  the  Minimum  of  n Numbers 

Let  S be  a set  of  n elements  and  let  R be  a total  order  of 
the  set  S.  Assume  that  the  relation  R is  such  that  each  processor  of 


I 


• 
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the  unbounded  parallel  model  can  calculate  minR  {x,y},  for  x,  y z S, 
in  c time  units,  for  some  constant  c.  (Although  R will  most  often 
refer  to  the  usual  ordering  on  the  integers,  in  some  cases  S will 
represent  a set  of  vertices  of  a graph  and  R will  be  a relation  on 
vertices.)  For  simplicity,  assume  n is  a power  of  two.  The  minimum 
element  of  S can  be  found  by  setting  up  a tournament  in  the  following 
way.  In  the  first  round,  the  elements  of  S are  paired  off  and  the 
minimum  element  of  each  pair  is  called  the  winner.  In  the  second 
round,  these  winners  are  paired  off  for  new  matches  and  the  winners  of 
these  matches  are  determined  as  before.  The  process  continues  until 
there  is  only  one  element  which  has  been  a winner  after  every  round. 
This  element  is  the  minimum  element  of  S.  Since  n/2t  winners  remain 
after  round  t,  log  n matches  must  be  carried  out  before  a single  winner 
is  obtained. 

ALGORITHM  MIN  (1)  below  describes  how  this  procedure  is 
implemented  on  the  unbounded  parallel  model  of  computation  described 
in  Section  1 .1 . 

ALGORITHM  MIN  (1) 

Assume  that  n elements  are  stored  in  a 1 x n array  AQ. 

Find  min  {AQ(1 ) . ...»  AQ(n)}  in  the  following  way. 

1.  Let  t = 0. 

2.  If  t * log  n,  stop  the  computation,  having  found  At ( 1 ) , the  mini- 
mum of  AQ(1),  . . .,  AQ(n).  Otherwise,  increment  t by  1. 

3.  Partition  (At_1  (1 ) , ....  Atl  (n/2t_1  )>  into  n/2*  sets,  S^, 


:(n/2t) 


. SjM/t  , of  two  elements  each.  Assign  one  processor  to  each 


set  to  compute  At(i),  a minimum  of  the  two  elements  of  s| 
Then  return  to  step  2. 


(0 


The  time  and  processor  requirements  for  each  step  are  as 
follows.  In  step  1,  one  processor  is  assigned  to  store  a number  in  the 
memory.  This  takes  at  most  1 time  unit.  In  step  2,  one  processor  is 
assigned  to  test  whether  or  not  t = log  n.  Depending  on  the  result, 
the  processor  may  stop  and  await  further  instructions  or  it  may  incre- 
ment the  content  of  a memory  location  by  1 . In  either  case,  step  2 
takes  at  most  a constant  number  of  time  units.  In  step  3,  n/2^  pro- 
cessors are  used,  one  for  each  of  the  n/2*"  sets.  Since  it  was  assumed 
that  a single  processor  can  find  the  minimum  of  two  elements  in  c time 
units,  step  3 is  executed  with  n/2t  processors  in  c time  units. 

Since  step  1 is  performed  only  once,  step  2,  log  n + 1 times, 
and  step  3,  log  n times,  the  total  time  used  is 

I 

1 + c (log  n + 1)  + clog  n = 0(log  n). 

The  maximum  number  of  processors  in  use  during  any  unit  of  time  is  n/2, 
which  occurs  in  the  first  iteration  of  step  3.  Since  these  processors 
can  be  reassigned  for  later  use,  n/2  processors  suffice  to  execute  the 
entire  algorithm.  In  the  case  where  n is  not  necessarily  a power  of 
two,  the  procedure  above  can  be  modified  to  find  the  minimum  of  n ele- 
ments in  time  0(log  n)  with  Tn/21  processors. 

There  is  a way  to  solve  this  problem  with  asymptotically 
fewer  processors.  ALGORITHM  MIN  (2)  below  is  a modification  of 
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ALGORITHM  MIN  (1)  and  can  be  used  to  find  the  minimum  of  n elements  in 
time  0(log  n)  using  only  Tn/log  nl  processors.  Note  first  that  a single 
processor  can  be  used  to  find  the  minimum  of  k elements,  x(l),  . . x(k), 

I I 

in  c • (k-1)  time  units  for  some  constant  c . Let  m(l)  = 
min  {x(l),  x(2)}.  For  t=2,  . . k-1,  let 

m(t)  = min  (m(t-l),  x(t+l)}. 


Then 


m(k-l)  = min  (x(l),  ....  x(k)}. 

Since  for  t=l , . . k-1,  m(t)  can  be  calculated  from  m(t-l)  and  x(t+l) 

• I 

in  c time  units,  for  some  constant  c , m(k-l)  can  be  computed  from 

I 

x(l),  . . . , x ( k ) in  time  c • (k-1). 

ALGORITHM  MIN  (2) 

Assume  that  n elements  are  stored  in  a 1 x n array  A. 

Find  min  (A(l),  . . . , A(n)}  in  the  following  way. 

1.  Partition  the  n elements  into  Tn/log  nl  sets  S, , . . . , Sr  .. 

1 I n/ log  n l 

of  at  most  Tlog  nl  elements  each.  Assign  one  processor  to  each  set, 

S. , to  compute  m(i),  a minimum  element  of  S. . 

2.  Use  ALGORITHM  MIN  (1)  to  find  m,  the  minimum  of  the  Tn/log  nl 
elements,  m(l),  . . . , m(rn/log  nl).  Then  m = min  {A ( 1 ) , . . . , A(n)}. 

In  step  1,  each  of  Tn/log  nl  processors  must  find  the  minimum 
of  at  most  flog  nl  numbers,  which  can  be  done  in  time  at  most 


c • (flog  nl-1)  as  in  the  discussion  preceding  the  algorithm.  Step  2 
is  an  execution  of  ALGORITHM  MIN  (1)  on  a set  with  Tn/lcg  nl  elements 
which  has  been  shown  to  use  time  0( 1 og  Tn/log  nl)  and  ^{n/log  n] 
processors.  Thus  the  total  time  required  for  ALGORITHM  MIN  (2)  is 

I 

bounded  above  by  c (flog  nl-1)  + c-jlog  Tn/log  nl  = 0(log  n).  The  maxi- 
mum number  of  processors  used  is  Tn/log  nl,  which  occurs  in  step  1. 

Note  that  these  algorithms  can  be  modified  to  find  the  maximum 
of  n numbers  in  time  0(log  n)  with  Tn/log  nl  processors. 

Determining  Whether  a Binary  Vector  Contains  a One 

Let  v = [v(l),  ....  v(n)]  be  an  n-dimensional  vector  where 
v(i)  e {0,1}  for  i=l,  . . . , n.  It  will  be  necessary  in  some  later 
algorithms  to  determine  if  such  a vector  contains  a one.  This  can  be 
done  in  time  0( log  n)  with  \n/ log  nl  processors  using  ALGORITHM  MIN  (2) 
in  the  following  way. 

1.  Let  "5"  be  the  usual  ordering  of  the  integers.  Find  M = max^  (v(l), 

. . . , v(n)>  using  ALGORITHM  MIN  (2). 

2.  The  vector  v contains  a 1 if  and  only  if  M = 1. 

Determining  Transitive  Closure  of  a Boolean  Matrix 

Let  G = (V,E)  be  a directed  graph.  The  transitive  closure  of  G 

* * — ■>  * 

is  the  graph  G = (V, E ) where  for  x,  y e V,  (x,y)  e E if  and  only  if 

there  is  a path  from  i to  j in  G of  length  at  least  0.  Let  A be  an 

n x n Boolean  matrix.  Then  A is  the  adjacency  matrix  of  the  graph 
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Ga  = (Va,Ea),  where  VA  = {1,  . . . , n}  and 

Ea  = { ( i , j ) 1 1 , j C V and  A(i,j)  = 1}. 

★ 

The  transitive  closure  of  A is  A , the  adjacency  matrix  of  the  graph 
* 

GA* 

Let  B and  C be  nxn  Boolean  matrices. 

Define  the  sum  of  B and  C to  be  the  nxn  matrix  B + C,  • 

where 


(B+C)(1  ,j)  - max  (B(i,j),  C(i,j)}. 

Define  the  product  of  B and  C as  the  nxn  matrix  BC,  where 

(BC)(i,j)  = max  {min  (B(i,k),  C(k,j)}  | k = 1,  . . . , n}. 

The  following  theorem  gives  a way  of  calculating  A from  A. 

Theorem  1 . Let  G 3 (V,E)  be  a graph  where  V = {1 n}.  Let  A 

be  the  adjacency  matrix  of  G and  I,  the  nxn  identity  matrix.  Then  for 
any  nonnegative  integer  k,  (A+I)  (i,j)  =1  if  and  only  if  there  is  a 
path  In  G from  1 to  j of  length  at  most  k,  where  the  addition  and 
multiplication  are  as  defined  above. 

Proof:  If  t is  a nonnegative  Integer,  then  At(i,j)  * 1 if 
and  only  If  there  Is  a path  in  G,  of  length  t,  from  1 to  j.  (See  [9], 
p.  151.)  Since  (A+I)**  * Ak  + Ak_1  + . . . + A+I,  the  theorem  follows. 
QED 
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Since  the  length  of  the  shortest  path  in  G between  any  two 

★ n-i 

vertices  is  at  most  n-1,  this  implies  that  A = (A+I)  and  further, 

* k * 2f1og(n"1)J  * 

that  A = A for  k £ n-1.  In  particular,  A = A and  thus  A 

can  be  obtained  by  squaring  the  matrix.  A,  riog(n-l)l  times.  This  fact 
is  the  basis  of  a parallel  algorithm,  due  to  Hirschberg  [11],  which  com- 

p 

putes  the  transitive  closure  of  a Boolean  matrix  in  time  0 ( 1 og  n)  using 

3 2 

n processors.  The  number  of  processors  <?an  be  reduced  to  n Tn/log  nl 

if  ALGORITHM  MIN  (2)  is  used.  Recently,  Chandra  [6]  has  developed  a 

parallel  algorithm  which  finds  the  product  of  two  nxn  matrices  in  time 

0(log  n)  using  n1og  7 processors.  It  is  based  on  Strassen's  algorithm 

★ 

[27]  for  sequential  matrix  multiplication.  If  A is  computed  by  squaring 
the  matrix.  A,  T 1 og (n-1 ) 1 times,  where  the  squaring  is  done  by  using 
Chandra's  algorithm,  then  A can  be  computed  from  A in  time  0(log  n) 
using  only  (log  n)nlog  7 processors.  If  A is  a symmetric  nxn  Boolean 
matrix,  the  transitive  closure  can  be  found  in  time  0(log  n)  using  only 
nrn/log  nl  processors.  This  is  a consequence  of  Hirschberg's  algorithm 
[11]  for  finding  the  connected  components  of  an  undirected  graph  and 
will  be  discussed  in  detail  in  Section  3.3. 

Sorting 

Let  R be  a total  order  of  a set  S.  An  array  X(l),  . . . , 

X (n ) of  elements  of  S is  sorted  if  X(i)RX(j)  for  i = 1 , . . . , n-1 . 

A sorting  algorithm,  given  a lxn  array  X of  elements  X(l),  . . . , 

X(n)  of  S,  computes  a lxn  array  Y,  where  Y(l),  . . . , Y(n)  consists  of 
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the  elements  X(l),  . . . , X (n)  in  sorted  form.  Preparata  [18]  has 
developed  an  algorithm  to  do  this  in  time  clogn  with  nTlog  n]  processors, 
for  some  constant  c. 

Although  they  will  not  be  used  here,  it  is  interesting  to  note 
that  classes  of  sorting  algorithms  have  been  found  by  Preparata  [18] 
and  Hirschberg  [12]  which,  for  any  given  a>o,  sort  n numbers  in  time 
(c/a)log  n with  n processors.  Those  of  Preparata  have  the  practical 
advantage  that  there  is  no  time  unit  during  which  two  processors  are 
requested  to  read  the  content  of  the  same  memory  location. 


I 
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Chapter  3 
SPANNING  TREES 

In  Section  3.2  of  this  chapter,  a parallel  algorithm  is  pre- 
sented which  finds  a minimum  spanning  tree  of  a connected,  undirected, 

2 

weighted  graph  with  n vertices  in  time  0( 1 og  n),  using  nTn/log  n]  pro- 
cessors. It  is  shown  in  Section  3.3  that  this  algorithm  can  be  modified 
to  find  a spanning  tree,  the  connected  components,  and  the  transitive 
closure  of  a connected,  undirected  graph  within  the  same  time  and  pro- 
cessor bounds.  An  algorithm  for  finding  a directed  spanning  tree  of  a 
connected,  undirected  graph  is  presented  in  Section  3.4. 

The  algorithm  presented  here  for  finding  a minimum  spanning 
tree  uses  techniques  from  [11]  and  [32],  and  it  appears  in  [26]. 

3. 1 Definitions 

A tree  was  defined  in  Section  2.2.  Note  that  a 
tree  with  n vertices  has  n-1  edges.  Let  G = (V,E)  be  a connected, 

I 

undirected  graph.  A spanning  tree  for  G is  a tree  T = (V,E  ) where 

I 

E _ E.  G is  a weighted  graph  if  there  is  a function  w : E •*>  IR . The 
weight  matrix  of  G Is  the  nxn  matrix  W,  where  for  1 , j e V, 

/w(1.j),  if  (i.j)  e E 

W(1,j)  A 

I®,  otherwise. 

(Here,  « is  a symbol  which  has  the  property  that  x < « for  every  real 
number  x.) 


A minimum  spanning  tree  (MST)  of  the  weighted  graph  G is  a spanning  tree 


I 

T = (V,E  ) of  G such  that  £ , w(e)  is  minimal  among  all  spanning  trees 

eeE 

of  G. 

3.2  The  Minimum  Spanning  Tree  Algorithm 

Let  G = (V,E)  be  a connected,  undirected,  weighted  graph,  with 

weight  function  w.  Assume  that  the  vertices  of  G are  labeled  by  the 

integers  1 through  n,  that  is,  V = {1,2,  . . . , n}.  Let  W be  the  weight 

matrix  of  G.  An  algorithm  is  presented  here  which  finds,  from  W,  the 

adjacency  matrix,  M,  of  a MST  of  G.  The  construction  used  is  described 

below.  It  will  be  shown  that  it  is  possible  to  carry  out  the  construction, 

that  the  construction  does  result  in  a MST  of  G,  and  that  the  construction 

2 

can  be  executed  by  the  unbounded  parallel  model  in  time  0(log  n)  with 
nTn/log  nl  processors. 

The  notation  used  in  the  construction  below  will  be  used 
through  the  remainder  of  Chapter  3. 

Construction  of  a Minimum  Spanning  Tree 

A MST  of  G will  be  constructed  inductively  as  follows. 

1.  For  each  i e V,  let  dQ(i)  = i.  Let  Gq  be  the  graph  (V,<J>). 

2.  If  dQ(i ) f dQ(j)  for  some  pair  i , j e V,  it  will  be  shown  that  it  is 
possible  to  choose,  for  each  i e V,  an  edge  E^(i)  of  G,  incident  with  i, 
such  that  (a)  w(E-|(i))  is  minimal  among  all  edges  of  G incident  with  i 
and  (b)  the  undirected  graph  G^  = (V,T-|),  where  T-j  = l'E^(i)  | i e V}, 
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contains  no  cycles.  Define  a relation  "=^"  on  V by  i=^j  if  and  only 
if  there  is  a path  in  joining  i and  j.  Then  is  an  equivalence 
relation  on  V and  therefore  partitions  V into  equivalence  classes. 

Choose  one  vertex  in  each  equivalence  class  as  a representative.  Let 
d^(i)  be  the  representative  of  the  equivalence  class  of  = ^ which  contains 
i.  For  i e V,  let 

[ill  = (j  e V | d] ( i ) = d] ( j ) } . 

Then  the  set  * {[i]-j  | d^(i)  = i}  is  the  collection  of  equivalence 
classes  of  V under  the  relation  =^. 

3.  Assume  inductively  that  for  some  t £ 1,  Gt>  T^t  and,  for  i = 1, 

. . . , n,  dt(i)  and  [i]^  have  all  been  defined.  If  dfc(i)  t dt(j)  for 
some  pair  i,  j c V,  it  will  be  shown  that  it  is  possible  to  choose,  for 
each  i e V with  dt(i)  s i , an  edge  Et+1(i)  such  that  (a)  w(Et+1(i))  is 
minimal  among  all  edges  of  G which  join  a vertex  in  [i]fc  to  a vertex  in 
another  equivalence  class  in  and  (b)  the  undirected  graph  Gt+.j  = 
(V,Tt+1),  where 

Tt+1  = Tt  U {Et+l^  I J e V and  dt(j)  = j} 

contains  no  cycles.  Define  a relation  =t+^  on  V by  i =t+^  j if  and 
only  if  there  is  a path  in  Gt+^  joining  1 and  j.  Then  is  an 
equivalence  relation  on  V and  therefore  partitions  V into  equivalence 
classes.  Choose  one  vertex  in  each  equivalence  class  as  a representative. 
For  i e V,  let  dt+^(i)  be  the  representative  of  the  equivalence  class 


of  =t+.|  which  contains  i.  For  i e V let 

[i]t+1  = (j  e V | dt+1(i)  = dt+](j)}. 

Then  the  set 

St+1  3 *^t+l  I dt+l^  = 

is  the  collection  of  equivalence  classes  oV  V under  the  relation  • 

It  will  be  shown  in  Theorem  2 that  for  each  t * 0 such  that 
dt(u)  f dj.(v)  for  some  u,  v e V,  there  is  for  each  i e V,  with  d^(i)  ^ i, 
an  edge  Et+1(i)  of  6 which  satisfies  the  conditions  stated  in  the  con- 

•ft 

struction.  Let  t be  the  smallest  index  for  which  dt*(i)  = dt*(j)  for 
all  i,  j e V.  Theorem  3 shows  that  the  graph  G^*  is  a MST  of  G. 

Assume  that  G has  more  than  one  vertex.  For  i e V,  let 

m.j(i)  = min  {w(i  ,j)  | j e V}. 

Since  G is  connected,  m^(i)  < «>.  Define  a function  : V-^V  by 

C,  (i)  = min  {j  | j e V and  w(i,j)  = m^i)}. 

Let  G(C.j)  be  the  directed  graph  (V,E  ) where  E = {(i  ,C-|  (i ) ) | i e V}. 
Figure  1 shows  G(C-j)  for  a given  weighted  graph  G. 

Lemma  1 . Every  cycle  of  G(C^)  has  length  two. 

Proof : Let  v^  -*■  v^  . . . ■*•  v^  be  a cycle  of  G(C^)  where 
C-|(vr)  = vr+i  for  1 * r * k - 1 and  v1  = v^.  Then  2 < k < n.  By 


mmmm 
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> > > > 

w(vrv2)  1 w(vk  l,vk)  - w((vk_2 »vk_7 ) < . . . < w(v1,v2). 

Thus  all  edges  in  the  cycle  have  equal  weight.  Again  by  definition 
of  C-j , we  have  the  following  inequalities.  If  k is  even, 

V - V V ^ 

2 k ' k-2 2 

which  means  k = 2.  If  k is  odd, 

v2  S vk  s . . . * v3  i v1  * vkl  < . . . < v2 

which  is  impossible  if  k > 2.  QED 

Theorem  2.  Using  the  notation  in  the  construction,  for  each  t * 0,  such 

that  dt(u)  t dt(v ) for  some  u,  v e V,  and  for  each  i e V,  with  ( i ) = i, 

there  is  an  edge  Et+i(i)  of  G satisfying  conditions  (a)  and  (b)  in  the 
construction. 

Proof:  For  t = 0 and  i e V,  let  E^(i)  = (i,C^(i))  where  Cj 
is  the  function  defined  preceding  Lemma  1.  Then  by  definition  of  C-| , 
Ej(i)  is  an  edge  of  G incident  with  i and  w(E-j(i))  is  minimal  over  all 
edges  of  G incident  with  i.  Let  G-j  = (V,T^)  be  the  undirected  graph 
where  T^  = {E1 (i)  | i e V}.  If  G^  contains  a cycle  v^,  . . . , vk  then 

V1  = V k 38  4’  MV  = v2  or  vk-l  and  for  r = 2»  • • •»  k-1  * MV  = 

vr+1  or  vr_.| . Without  loss  of  generality,  assume  Cj(vj)  = v2-  If 
C-j(vr)  = v^  for  r ■ 1,  . . . k-1  then  v^  v2  ■*  • • • vk  is  a cycle 
of  length  at  least  3 of  the  directed  graph  G(C^),  contradicting  Lemma  1. 
Otherwise,  there  is  an  index  s,  2 * s * k,  such  that  C^(v$)  = v$_^ . 

But  then  for  s * r S k,  ( vr ) must  be  v by  definition  of  G^ . 


[ 

f • 
4. 

[ 
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This  is  a contradiction,  since  ( vk ) = C.|(v.|)  = v2  ^ vk-l*  Tflus 

contains  no  cycles.  (See  Figure  2(a).) 

Assume  inductively  that  for  some  t > 0,  such  that  dt(u)  t 
d^(v)  for  some  u,  v e V,  E^(i)  has  been  chosen,  for  all  i e V with 
dt(i)  = i , to  satisfy  (a)  and  (b)  of  the  construction.  Then  the  graph 

I I I 

Gt  = (V»Tt)  contains  no  cycles.  Let  G = (V  ,E  ) be  an  undirected, 
weighted  graph  whose  vertices  are  the  connected  components  of  G^.  Each 
connected  component  of  Gt  corresponds  to  an  equivalence  class  [i]t  in 
St»  Label  the  connected  component  of  G^  corresponding  to  [i]t  by 
d t ( i ) . Then  V = {i  e V | dt(i)  = i}.  For  each  i,  j e V , let  (i,j) 

f 

be  an  edge  in  E if  and  only  if  there  is  an  edge  of  G joining  a vertex 

I 

of  [i]t  to  a vertex  in  [j]t.  Define  a weight  function  w for  (i,j)  e E 

I * 

by  w (i,j)  = min  (w(u,v)  | u e [i]^,  v e [j]t>.  G is  connected  since 
G is.  (See  Figure  2(b). ) 

» 

Define  a function  m^  on  V by 

I I 

mt+i ( 1 ) s min  O »j)  I o £ v }• 

Since  Gt  is  connected  and  contains  more  than  one  vertex,  mt+^(i)  f «°. 
Define  Ct+^  : V -*•  V by 

I 

Ct+](i)  * min  (j  | w (i.j)  = mt+1(i)L 

I I I 

Then  the  undirected  graph  Gt+1  = (V  ,Tt+.|),  where 


- ' - 
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Tt+1  = {(i’Ct+1(i))  I 1 e V }’ 

contains  no  cycles.  This  follows  as  in  the  proof  that  contains  no 

I 

cycles  by  replacing  C-j  by  Ct+1  and  by  Gt+1- 

I 

For  each  i e V , let  Et+.j(i)  be  an  edge  of  G of  weight 

I 

w (i,Ct+1(i))  which  joins  a vertex  of  [i]^  to  a vertex  of  [Ct+1(i)]t> 
with  the  stipulation  that  if  Ct+^  (C ( i ) ) = i,  then  Et+1(i)  = 

Et+i(ct+i(1»-  Let  Gt*:  ‘ <V-W-  "here 
Tt+i  ■ Tt  "W”  1 dt(i)  5 

I 

(See  Figure  3.)  Since  Gt+^  is  acyclic  if  and  only  if  G^  and  Gfc+^  are 
acyclic,  by  the  induction  hypothesis,  Gt+^  is  acyclic.  Further,  Et+-j(i) 
is  an  edge  of  minimum  weight  over  all  edges  which  join  a vertex  of 
equivalence  class  [i]t  to  a vertex  in  another  equivalence  class  in  S 
QED 

Theorem  3.  Let  t be  the  smallest  index  such  that  dt*(i)  = dt*(j)  for 

all  i,  j e V.  Then  Gt*  is  a MST  of  G. 

★ 

Proof:  If  t =0,  then  G has  only  one  vertex  and  the  theorem 
is  true.  Thus,  assume  t > 0.  If  dt*(i)  = dt*(j)  for  all  i,  j e V, 
then  St*  has  only  one  equivalence  class,  so  Gt*  is  connected.  By 
Theorem  2,  Gt*  contains  no  cycles.  Thus  Gt*  is  a spanning  tree  of  G. 

It  remains  to  verify  minimality. 

It  will  be  shown  by  induction  on  t that  for  each  t,  with 

★ 

1 s t £ t , the  edges  of  G^  are  contained  in  a MST  of  G.  Let  e be  an 
edge  of  G of  minimum  weight.  Then  there  is  a MST  of  G which  contains 


(b)  The  graph  G2  if  Et+1  (dt(T ) ) = £t+i(dt(6))  = 0»5) 


(c)  The  graph  G2  if  Et+1(dt(l))  = Et+1  (dt( 6) ) * C1’4) 


Figure  3.  Three  possible  choices  for  G2  from  the  Graph  G of  Figure  1(a) 


wmmm 
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e [21].  By  definition,  G-j  = (V,T^)  must  contain  an  edge  of  weight  w(e), 
thus  at  least  one  edge  of  is  contained  in  a MST  of  G.  Let  T be  a 
maximal  subset  of  T-|  with  the  property  that  there  is  some  MST  of  G con- 
taining the  edges  of  T.  Let  H = (V,E(H))  be  a MST  of  G with  T £E(H). 

» * 

If  T ? T.| , let  e be  an  edge  of  T^  which  is  not  in  E(H).  Since  e e T^ , 

l I 

there  is  a vertex  v^  e V,  incident  with  e , such  that  e is  an  edge  of 

f 

minimum  weight  among  all  edges  of  G incident  with  i.  Adding  e to 

I 

H creates  a cycle  v^,  . . . , v^  where  = v^  and  e = (vk_-j  »vk ) • 
Then  (^(v^)  3 vk_j.  Let  r be  the  smallest  index  for  which  (vr>vr+1)  t T-j 
G^  contains  no  cycles,  1 < r < k-1 . Then  for  2 S s £ r,  C-|(vs)  = vs_]- 
Thus 

w(e  ) * w(v.| , v 2 ) < • . . S w(vr_rvr)  < w(vr,vr+1). 

then 

h'  = (V ,E(H)  (e ' } - {(vr,vr+1)}) 

I 

is  a MST  of  G containing  T {e  },  contradicting  the  maximality  of  T. 

Assume  inductively  that  for  ?ome  til,  the  edges  of  Gt  are 

★ 

contained  in  a MST  of  G.  If  G^  is  connected,  t = t and  G^.  is  a MST  of 

I I I 

G.  Otherwise,  let  G = (V  ,E  ) be  the  undirected,  weighted  graph  de- 
fined in  the  proof  of  Theorem  2,  where  the  set 

v'  = {i  e V | dt(i)  = i} 

I I 

represents  the  set  of  connected  components  of  Gt-  Let  Ct+1  : V "*■  V 
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be  defined  as  in  the  proof  of  Theorem  2.  Then  the  undirected  graph 

Gt+1  = <V’W’  where 

T^+1  - ((i,Ct+1(i))  | i e v'}f 

I 

is  a MST  of  G . This  follows  as  in  the  case  where  t = 1 by  replacing 

I I 

G by  G , by  C and  G^  by  Gt+^ . Recall  that 

Tw  * Tt u !Ew(i)  i dt<<>  ■ 11 

where  Et+j(i)  is  an  edge  of  G joining  a vertex  of  [i]t  to  a vertex  of 

I 

[Ct+1(i)]t  and  w(Et+1(i))  = w (i,Ct+1(i)).  Then  the  edges  in  Tt+1  are 
contained  in  a MST  of  G if  and  only  if  the  edges  in  T^  are  contained  in 
a MST  of  G.  (This  can  be  proven  by  an  argument  similar  to  the  one  used 
to  prove  that  the  edges  of  G-|  are  contained  in  a MST  of  G.)  By  the 
induction  hypothesis,  the  edges  in  T^  are  contained  in  a MST  of  G,  thus, 
the  edges  of  Tfc+1  are  contained  in  a MST  of  G. 

Thus,  the  edges  of  Gt*  are  contained  in  a MST  of  G.  Since 
Gt*  is  acyclic  and  connected,  Gt*  is  a MST  of  G.  QED 

It  follows  ^rom  Lemma  2 below  that  t ^ Hog  nl. 

Lemma  2:  For  t * 1,  St  contains  at  most  n/2*  equivalence  classes. 

Proof:  If  t = 1,  each  equivalence  class  of  S^  has  at  least 
2 vertices,  so  |S-||  £ n/2  and  the  lemma  is  true.  Assume  for  some  t ? 1 
that  |St|  s n/2t.  Let  i e V with  dt(i)  t i.  Edge  Et+^(i)  joins  a vertex 
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u of  [i]t  with  a vertex  v in  some  distinct  equivalence  class  [j]t  of 
St>  By  definition  of  =t,  every  pair  of  vertices  of  [i]t  can  be  joined 
by  a path  in  G^  and  similarly  for  every  pair  of  vertices  in  [j]^.  Then 
every  pair  of  vertices  in  [i ] t U [j]t  can  be  joined  by  a path  in  Gt+1 , 
since  Et+1(j)  e Tt+1>  Thus  [i]t  U [j ] t is  contained  in  a single  equivalence 
class  of  St+.| . Since  is  a refinement  of  =t+-j » This  shows  that  every 
equivalence  class  in  is  the  union  of  at  least  two  equivalence  classes 
in  St,  thus  |St+1|  S |St|/2  < n/2t+1.  QED 

It  is  shown  below  how  to  implement  the  construction  of  G^*  from 
G on  the  unbounded  parallel  model  of  computation.  Initially,  the  weight 
matrix,  W,  of  G is  stored  in  the  memory  and  at  the  end  of  the  algorithm, 
the  weight  matrix,  M,  of  Gfc*  is  stored  in  the  memory.  The  entire  algo- 
rithm is  presented  and  analyzed  at  the  end  of  this  section. 

Construction  of  G and  d„ 

o o 

Since  G = (V,<J>)  and  d (i)  = i for  all  i e V,  the  following 
o o 

steps  compute  dQ  and  the  adjacency  matrix  of  GQ. 

1.  For  each  pair  i,  j e V,  let  M(i,j)  = 0.  Then  M is  the  weight 

matrix  of  G . 

o 

2.  For  each  i e V let  dQ(i)  = i. 

Construction  of  G1  and  d1 

If  dQ(i)  = d0(j)  for  all  i,  j t V then  G consists  of  a single 
vertex  and  Gq  is  a spanning  tree  of  G.  Otherwise,  it  was  shown  in 
Theorem  2 that  G^  can  be  calculated  in  the  following  way. 
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3.  For  each  i e V 

(a)  let  m^i)  = min  {W ( i , j ) | j e V} 

(b)  let  C^(i)  = min  {j  | W(i,j)  = m^ ( i ) } . 

Then  = (V,T^)  where  = {(1»Cj(i))  | i e V}.  In  the  con- 
struction of  G2.  some  operations  are  performed  only  for  i e V such  that 
d^(i)  = i.  Thus  d^  must  be  computed.  That  is,  for  each  equivalence 
class  [j],  e S-|  a vertex  of  [j]-|  must  be  chosen  as  a representati ve. 
Each  equivalence  class  in  corresponds  to  a connected  component  K of 
G^.  It  is  shown  in  Lemma  3 that  each  connected  component  K of  G^  con- 
tains exactly  one  pair  of  vertices  iK>  such  that  C-j  ( i K ) = and 
Ci(JK)  = i^.  Then  min  {i^,  j^}  is  chosen  to  be  the  representative  of 
the  equivalence  class  of  corresponding  to  K.  In  the  example  of 
Figure  1,  d^(i)  = 1 for  i = 1,2,3  and  d^(j)  = 6 for  j = 4,  . . . , 8. 

Lemma  3.  Let  K be  a connected  component  of  G^.  Then  K contains  exactly 
one  pair  of  vertices  i , j such  that  C^i)  = j and  C^j)  = i. 

Proof:  For  a vertex  v of  K,  the  vertices  in  the  sequence,  v, 
C-|(v),  C^(C^(v)),  ....  are  all  in  K and  cannot  all  be  distinct.  Then 
there  must  be  a sequence 

Cf(v),  CpV).  ....  C*(v) 

S 

of  distinct  vertices  of  K for  some  r and  s,  where  C^(C^(v))  = C-|(v). 

If  s f r+1 , then  the  cycle 

C^(v)  Cp^v)  C*(v)  dj’(v) 
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is  a cycle  of  length  at  least  3 in  the  directed  graph  G(Cj)  of  Lemma  1, 

P s 

contraaicting  Lemma  1.  Thus  C^(v)  and  C^(v)  are  vertices  of  K such  that 
C1  (C^v))  = Clf+1(v)  = C*(v)  and  C^C^v))  = cj(v). 

I I 

Assume  i , j and  i ,j  are  distinct  pairs  of  vertices  of  K for 

II  II 

which  C.|(i)  = j,  Cj(j)  = i,  C^(i  ) = j and  Cj(j  ) = i . Let  v1 , . . . , 

I I 

v^  be  a path  in  K where  v^  e {i,j}  and  vk  e {i  ,j  } and  for  r = 2,  . . . , 

I I 

k-1 , v^  i {i,j,i  ,j  }.  Then  it  must  be  that  ( v2 ) = v^  and  C-j  (v^-j ) = 
v^.  By  definition  of  G^ , for  s * 2,  . . . , k-1,  if  C^(v$)  = v$  -j , then 
Cl  (vs+l ) = V Thus>  since  C^Vj)  = v1 , C1(v((_1)  = vR_2  f-  vk-  This 
is  a contradiction.  Thus  K has  only  one  pair  of  vertices  i , j with 
C-j (i)  = j and  C1  (j)  * i.  QED 

The  function  will  be  modified  slightly  to  obtain  a function 

I 

where 

/i,  if  i is  a representati ve  of  an  equivalence 
l class  in 

c‘ (i)  = \ 

|C^(i),  otherwise. 

Since  ( C-j  ( i ) ) = i if  and  only  if  i or  C(i)  is  an  equivalence  class 
representative,  the  edges  (i,C^(i))  and  (C^(i),  C^(C^(i)))  of  G1  are 
identical  and 

T-|  = {(1  *C<|(i))  | i e V and  i t Cj(1)}. 

I 

The  steps  below  compute  the  function  and  the  weight  matrix  of  G-| . 
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5.  For  each  i e V,  let 


C-,(i) 


(min  {1  *C-|  (i ) } , if  C^C^i))  = i 


[C-j  (i ) , otherwise. 


I 


I 


6.  For  each  i e V with  C^(i)  t i,  replace  M(i ,C^ (i 

I 

by  W(i,C-j(i)).  Then  M is  the  weight  matrix  of  Gj . 


Lemma  4.  For  each  i e V,  let  0 (i)  = C^(i)  and  for 

Dk-l(Dk-l(i))-  Then  dl(i)  = °riog(n-l)l(i)  for  a11 
Proof:  Let  i e V.  Let  i = vi »v2 » • • •» 

simple  path  in  joining  i and  d^i).  Then  r - n. 

I I 

d,(i)  f vr_i » we  must  have  C^(vr_.j)  = vr  and  thus  C-j 
s = 1,  . . r-1.  Then  for  s = 1,  . . . , r. 


!vs+l * if  s + 1 < r 

i 

vr,  if  s +1  ^ r 

and  for  k > 1 , 

(vs*2k-  ff  s t 2k  < r 

w -} 

(v  If  s * 2k  : r. 


) and  M(C1 (i ) ,i ) 

k > 1,  let  Dk(i) 
i e V. 

vr  = d^i)  be  a 

I 

Since  C-j  (d^  (i )) 
(v,)  - vs+,  for 
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Thus 


Driog(n-l)l(i)  = °riog(n-l)l(vl)  = vr  = dl(l)’ 
since  1 + 2riog(rH)1  > n-1  > r-1 . QED 

I 

Then  can  be  computed  from  in  the  following  way. 

7.  Let  k = 0 and  for  i e V,  let  Dq ( i ) = C-j(i). 

8.  If  k 2 log(n-l),  then  for  i e V let  d^i)  = Dk(i).  Otherwise, 
increment  k by  1 . 

9.  For  each  i e V,  let  DR(1 ) = Dk_i (Dk_1 ( 1 ) ) • Then  return  to  step  8. 
Construction  of  Gt+-j  and  d^  from  and  d^ 

If  dt(i)  = dt(j)  for  all  i ,j  e V,  then  Gt  is  a MST  of  G. 
Otherwise,  it  was  shown  in  Theorem  2 that  Gt+1  can  be  constructed  from 
Gt  and  dt  by  finding,  for  each  i e V such  that  d^(i)  = i , an  edge  E^Ci) 
of  G satisfying  certain  properties.  Then  G^  = (V,Tt+-|)  where 

Tt+1  = Tt  U ^Et+1^^  I dt^  = For  each  1 e V such  that  dt^  = 
define  mt+1 ( i ) , as  in  the  proof  of  Theorem  2,  by 

mt+l(i)  = min  (w(u,v)  | u e [i]t  and  for  some  j e V, 
v e [j]t  and  i f j}. 

Since  G is  connected  and  G^  has  more  than  one  component,  m^+^(i)  < °°. 

For  each  i e V such  that  d^(i)  = i,  define  C^.+i  ( i ) * as  in  the  proof  of 
Theorem  2,  by 
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ct+](i)  = min  < J e V | i j*  j , dt(j)  = j and  for  some 
u e [i ]fc , v e [j]t>  w(u,v)  = mt+-| ( i ) ) • 

Then  by  Theorem  2,  for  each  i e V with  dt(i)  = i,  Et+1(i)  is  an  edge  of 

6 of  weight  mt+^(i)  which  joins  a vertex  in  [i]t  to  a vertex  in 

[Ct+-|(i)]t'  However,  there  may  be  several  edges  of  G with  this  property 

and  any  one  may  be  chosen  as  E^.+j(i).  (See  Figure  3.)  To  have  a precise 
algorithm,  there  must  be  a rule  for  choosing  one  edge  from  among  all 
possible  choices  for  E ( i ) . Assume  ( i i » j -, ) , ....  ( i k , jk ) are  the 
edges  of  G which  satisfy,  for  r = 1 , . . . , k,  e [i 3t , jp  e [Ct+1 (i )3t , 
and  w(ir,jr)  = mt+1(1).  Let 

ht+l  ( 1 ) 3 mtn  I •"  * • • . , k} 

and 


gt+1(i)  55  min  (jp  | r = 1 , . . . , k and  ip  = ht+1(i)}. 

Then  let  Et+1  ( 1 ) = (ht+^(i)»  gt+-j ( i ) ) . In  the  example  of  Figure  3, 

£,(1)  » E, (6)  3 (1,4). 

The  computation  of  Ct+1(i)  and  Et+^(i),  for  each  i c V with 

I I 

d^(i)  3 i,  is  made  precise  below.  The  functions  m^+^ , at+^ , and  9t+^ , 
which  will  be  defined,  are  just  intermediate  steps  in  the  calculation  of 

mt+r  ht+r  9t+l*  and  Ct+T 
12.  For  each  i e V, 

(a)  let  mt+1(i)  3 min  (W(i,j)  | j e V and  dt ( j ) t dt ( i ) > 

(b)  let  at+1(i)  3 min  {dt ( j ) | j e V,  dt(j)  t dt(i)  and 

W(i,j)  * mt+1(i)} 


b 


l 

I 
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13. 


(c) 

let  1 

9tt,d) 

= min  {j  e V | dt(j)  = i 

it+1(i)  and  W(i,j 

mt+l ( 1 ) ^ • 

For  each 

i e 

V such  that  dt(i)  = i 

(a) 

let 

W* 

1 

= min  {mt+1(j)  | dt(j) 

= i.  j £ V} 

(b) 

let 

= min  (at+1(j)  | j e V 

, dt(j)  = i and 

Vi(j')  = mt+l(i)} 

(0 

let 

ht+](i) 

* min  (j  e V | dt(j)  = 

i and  mt+1(j)  = 

mt+1(i)  and  at+1(j)  = 

ct+l (i)> 

(d) 

let 

W> 

3 9t+l(ht+l(1))' 

At 

this 

point,  G 

t+^  has  been  found  and  ■ 

its  weight  matrix 

can  be  constructed.  It  remains  to  calculate  d 


t+1 


Let  Gt+,  » (V  ,Ttt,) 


where  V = {i  e V ) dt(i)  = 1}  and 

Tt+1  = Ui  »Ct+i(i))  | i e V and  dt(i)  = i}. 

I I 

Choose  a vertex  of  V from  each  connected  component  of  as  a representa- 

I I 

tive.  For  each  i e V , let  dt+^(i)  be  the  representative  of  the  connected 

I 

component  of  V containing  i.  The  component  representatives  can  be 

I 

chosen  and  dt+l  can  be  computed  just  as  equivalence  class  representatives 
were  chosen  and  d^  computed  in  steps  5 through  9. 

14.  For  each  i e V with  d-j(i)  = i,  let 


.in  {1,  Ctt,(1)>,  If  Ctt,(Ct+1(l»  = 1 


Cttl(D  = 


. f 


I 

t 


C^.+i (i ) , otherwise. 
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I 

15.  For  each  i e V with  ( i ) ■ i and  Ct+1(i)  f i,  replace 
M(ht+1(i),  gt+1(1))  and  M(gt+1(i),  ht+1(i))  by  W(ht+1(i),  9t+1 ( i ) ) - 
Then  M is  the  weight  matrix  of  Gt+1 • 

16.  Let  k = 0 and  for  i e V with  dfc(i)  = i,  let  DQ^+^(i)  = Ct+^(i). 

I 

17.  If  k - log(n-l),  then  for  each  i e V with  d ^ ( i ) = i,  let  dt+^(i)  3 
D^+^(i)  and  go  on  to  step  19.  Otherwise,  increment  k by  1 . 

18.  For  each  i e V with  d^fi)  = i,  let 

°kt+1)(i)  = °k-!1)(Dk-!1)(i))* 

Then  return  to  step  16. 

Let  S be  an  equivalence  class  in  S^-j.  Then  S is  the  union  of 
some  equivalence  classes  in  St.  In  step  14,  an  equivalence  class  [k] t 
e S^,  where  dt(k)  = k,  was  chosen  as  a representative  of  the  collection 

of  all  equivalence  classes  [j]^  e S such  that  [j]t  ? S.  In  stops  15 

I 

through  17,  d^  was  calculated  so  that  for  each  j e V with  d t ( j ) = j, 

I 

such  that  [j]t  c S,  dt+^(j)  = k.  We  now  choose  the  vertex  k to  be  the 

representative  of  the  equivalence  class  S and  compute  d^+^  : V -*•  V 

so  that  d 1 ( i ) * k for  every  i e S. 

19.  For  each  i e V,  let  dt+1(i)  = dt+i (dt ( i ) ) - Note  that  if  dt(i)  = i, 
then  dt+1(i)  = dt+1(i). 

To  complete  the  algorithm,  there  must  be  a way  to  decide  for 
each  t J 0 whether  or  not  dt(i)  * dt(j)  ^or  i »J  e v-  This  can  be 

done  in  the  following  way.  For  i ■ 1,  . . . , n-1 , let 
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!0,  if  dt(i)  = dt(i+l) 

; 

1,  otherwise. 

Then  d^ ( i ) t d^ ( j ) for  some  i , j e V if  and  only  if  the  n-1  dimensional 
vector  [zt(l),  ....  zt(n-l)]  contains  a 1.  It  was  shown  in  Section  2.2 
that  this  can  be  decided  in  time  0(log(n-l))  with  n-1  processors. 

The  entire  algorithm  is  presented  below.  After  each  step, 
the  time  and  number  of  processors  used  to  perform  the  step  are  given. 
Following  the  algorithm,  there  is  an  analysis  of  the  time  and  processor 
complexities  of  the  algorithm  as  a whole. 

ALGORITHM  MST 

Let  G = (V,E)  be  a connected,  undirected,  weighted  graph, 
where  V = (1 , . . .,  n>.  Assume  that  the  weight  matrix, 

W,  of  G is  stored  in  the  memory.  Compute  the  weight 
matrix,  M,  of  a MST  of  G in  the  following  way. 

1.  For  each  pair  i , j e V,  let  M(i,j)  = 0.  (This  can  be  done  in  time 

p 

O(logn)  with  Tn  /lognl  processors.) 

2.  For  each  i e V,  let  dQ(i)  = i.  (constant  time,  n processors) 

3.  If  dQ(i)  = dQ(j)  f°r  all  i » j e V,  stop,  having  found  M.  Otherwise, 
go  on  to  step  4.  (By  the  remarks  preceding  the  algorithm,  this  can  be 
done  in  time  O(logn)  with  n-1  processors.) 

4.  For  each  i e V, 
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(a)  let  m^i)  = min  (W(i,j)  | j e V} 

(b)  let  C^i)  * min  (j  e V | W(i,j)  = m^ ( i ) } . (For  each 

i e V,  the  minimum  of  at  most  n numbers  must  be  found.  Thus,  by  the 
results  of  Section  2.2,  step  4 can  be  done  in  time  O(logn)  with 
nTn/lognl  processors.) 

5.  For  each  i e V,  let 


(min  {i,  C^i)),  if  (C^  ( i ) ) = i) 


c,m  -■ 


I ( i ) , otherwise. 


(constant  time,  n processors) 

I * I 

6.  For  each  i e V with  C^(i)  f i,  replace  M(i,C^(i))  and  M(Cj(i),i)  by 
W(i , C-j  ( i ) ) . (constant  time,  n processbrs) 

I 

7.  Let  k = 0.  For  i e V,  let  0Q(i)  = C^i).  (constant  time,  n processors) 

8.  If  k > log(n-l),  then  for  i e V,  let  d 1 ( i ) = Dk(i)  and  go  on  to  step 
10.  Otherwise,  increment  k by  1.  (constant  time,  n processors) 

9.  For  each  i e V,  let  Dk(i)  = Dk_-|  (Dk-i  ( i ) ) - Then  return  t0  steP  8. 
(constant  time,  n processors) 

10.  Let  t = 1.  (constant  time,  1 processor)' 

11.  If  dt(i)  « dt(j)  for  all  i ,j  e V,  stop,  having  found  M.  Otherwise, 
increment  t by  1.  (This  can  be  done  in  time  0(logn)  with  n-1  processors 
as  in  step  3.) 

For  each  i e V, 

(a)  let  mt+1(i)  = min  (W(i,j)  | j e V and  dt(j)  + dfc(i ) > 


12. 


(b)  let  at+1(i)  = min  (dt(j)  | j e V,  dt(j)  f dt(i)  and 
W(i  ,j)  = mt+1(i)) 

(c)  let  9t+1(i)  = min  {j  e V | dt(j)  = at+1(i)  and 

I 

W ( i , j ) = mt+^(i)}.  (This  can  be  done  in  time  O(logn)  with  nTn/lognl 

. 

processors,  as  in  step  4.) 

, 13.  For  each  i e V such  that  dt(i)  = i, 

I 

(a)  let  mt+1(i)  = min  (mt+1(j)  | j e V,  dt(j)  = i} 

(b)  let  Ct+1(i)  » min  (at+1(j)  | j e V,  dfc( j ) = i and 

mt+l(j)  = Vl(i)} 

I 

(r)  let  ht+1(i)  = min  {j  e V | dt(j)  = i,  mt+1(j)  = mt+1 ( i ) 
and  at+1(j)  = Ct+1(i)} 

I 

(d)  let  gt+|(i)  = 9t+1 (ht+7 (i ) ) . (This  can  be  done  in  time 
O(logn)  with  nTn/lognl  processors,  as  in  step  12.) 

14.  For  each  i e V with  d^(i)  = i,  let 

min  {i,  ct+1 ( i ) } , if  ct+1 (Ct+1 (i ) ) = i 

Ct+l(l)  = 

Ct+1(i),  otherwise. 

(constant  time,  n processors) 

I 

15.  For  each  i e V with  dt(i)  * i and  Ct+^(i)  ? i»  replace 
M(ht+1(i),  gt+1(i))  and  M(gt+1(i),  ht+1(i))  by  W(gt+1 (1 ) , ht+1(i)). 

r (constant  time,  n processors) 

16.  Let  k = 0.  For  each  i e V with  dt(i)  = i,  let  DQ^t+1^(i)  = Ct+^(i). 
(constant  time,  n processors) 


wmm 


- ” - • 
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I 

17.  If  k > log(n-l),  then  for  each  i e V with  dt(i)  = i,  let  dt+1(i)  = 
Dk^+^(i)  and  go  on  to  step  19.  Otherwise,  increment  k by  1.  (constant 
time,  n processors) 

18.  For  each  i e V with  d^ ( i ) = i,  let 

0k(t+1)(i)  = Dk_1(t+1)(Dk_1(t+1)(i)). 

Then  return  to  step  17.  (constant  time,  n processors) 

I 

19.  For  each  i e V,  let  d ^ ( i ) = dfc+-j  (dfc( i ) ) . Then  return  to  step  11. 
(constant  time,  n processors) 

Steps  8 and  9 of  ALGORITHM  MST  are  iterated  at  most 
F 1 og ( n-i ) 1 + 1 times.  Thus,  steps  1 through  10  can  be  performed  in  time 
0(logn)  with  nTn/lognl  processors.  Steps  11  through  19  of  ALGORITHM  MST 
are  iterated  at  most  Tlognl  times,  by  Lemma  2.  In  each  iteration,  steps 
17  and  18  are  performed  at  most  riog(n-l)]  + 1 times,  so  one  iteration  of 
steps  11  through  19  takes  time  at  most  0(logn)  and  nTn/lognl  processors. 
Thus  the  entire  algorithm  can  be  performed  in  time 

2 

0(logn)  + Tlognl  0(logn)  = 0(log  n) 
with  n[n/lognl  processors. 

The  best  serial  algorithms  for  finding  a minimum  spanning  tree 
o 

of  G have  time  complexity  0(n  ) [20],  using  the  adjacency  matrix  as  input, 

and  O(mloglogm)  [32],  where  m = |E|,  using  an  adjacency  list  representation 

2 2 

of  G.  Both  of  these  algorithms  have  time  complexity  0(n  ) if  m = 0(n  ). 

2 2 

Thus  ALGORITHM  MST  has  a speedup  of  0(n  /logn)  over  the  best  serial 
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algorithm  and  an  efficiency  of  0(l/logn).  This  indicates  that  there  is 

relatively  little  waste  incurred  by  using  nTn/lognl  processors  to  achieve 
2 2 

the  speedup  of  0(n  /log  n). 

3.3  Applications  of  the  MST  Algorithm 
Spanning  Trees 

Let  G = (V,E)  be  a connected,  undirected  graph,  where 
V = {1,  . . .,  n}.  Let  A be  the  adjacency  matrix  of  G.  Then  a spanning 
tree  of  G can  be  computed  from  A in  the  following  way. 

1.  For  each  pair  i , j e V,  let 

(-,  if  A(i,j)  = 0 

1 , otherwise 

2.  Execute  ALGORITHM  MST  using  W as  input.  The  nxn  matrix  M,  computed 
in  this  way,  is  the  adjacency  matrix  of  a spanning  tree  of  G. 

Since  step  1 can  be  done  in  time  O(logn)  with 

p 

Tn  /lognl  < p Tn/lognl  processors,  a spanning  tree  of  G can  be  found  in 
2 

time  0(log  n)  with  nTn/lognl  processors. 

In  the  algorithm  presented  in  Chapter  4 for  finding  the  bicon- 
nected  components  of  a connected,  undirected  graph  G,  the  first  step  is 
to  find  a spanning  tree,  S,  for  G.  For  this  problem,  the  adjacency  matrix 
of  S is  not  the  most  efficient  representation  of  S.  Let  V = {1,  . . .,  nl. 
Then  S can  be  represented  by  an  (n-1)  x 2 array  T where 
T(1  ,j ) g V and  the  set 
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{ (T ( i ,1)  *T ( 1 ,2))  | 1 - 1 n-U 

is  the  set  of  edges  of  S.  By  using  this  representation  of  S,  the  number 
of  processors  used  to  find  the  biconnected  components  of  G can  be  de- 
creased by  a factor  of  n. 

ALGORITHM  MST  can  be  modified  to  compute  an  n x 2 array  H where 
for  some  k e V,  H ( k , 1 )  I *  3 H(k,2)  = 0,  and  for  1 e V,  i + k,  and  j = 1,2, 

H(i  ,j ) e V and 

{ (H(i ,1 ) ,H(i ,2) ) | i c V,  i f k} 

is  the  set  of  edges  of  a spanntng  tree  of  G.  To  do  this,  at  the  beginning 
of  ALGORITHM  MST,  let  H( i ,j ) = 0 for  i e V and  j = 1 ,2.  Replace  step  15 

I 

by  1 5 below. 

I l 

15.  For  each  i e V with  dt(i)  3 i and  Ct+1(i)  t i» 
replace  H ( i , 1 ) by  ht+1(i)  and  H(i,2)  by  gt+1(i). 

If  for  some  i,  dt ( 1 ) = i and  C ( i ) f i,  then  dt+^(i)  f i.  Thus  for 
each  i,  there  is  at  most  one  edge  to  be  stored  as  (H(i  ,1 ) ,H(i  ,2) ) . 

Assume  now  that  the  n x 2 array  H has  been  computed  for  a span- 
ning tree  S of  G.  Compute  the  (n-1)  x 2 array  T in  the  following  way. 

1.  For  each  i e V,  let 

/o,  if  H ( i , 1 ) = H( 1 ,2)  = 0 
y(i)  -s 


i , otherwise. 


2.  Sort  the  numbers  y(l),  . . y(n)  in  decreasing  order  to  obtain  an 
array  z(  i ) , . . . , z(n). 

3.  For  i = 1,  . . .,  n-1 , let  T(i,l)  = H(z(i),l)  and  T(i,2)  * H(z(i),2). 
Then 

{ (T( i ,1 ) ,T(i  ,2))  | 1 - 1 n-1) 

is  the  set  of  edges  of  a spanning  tree  of  G. 

Steps  1 and  3 can  be  done  in  constant  time  with  n processors. 
Step  2 can  be  done  in  time  O(logn)  with  nTlognl  processors.  Thus  calcu- 
lation of  the  (n-1)  x 2 array  T can  be  incorporated  into  ALGORITHM  MST 
without  increasing  the  time  and  processor  complexity.  This  is  summarized 
in  the  theorem  below. 

Theorem  4.  Let  G = (V,E)  be  a connected,  undirected  graph,  where 
V = {1 , . . .,  n}.  There  is  an  algorithm  which  computes,  from  the  adja- 
cency matrix  of  G,  an  (n-1)  x 2 array  T,  where  T(i,j)  z V for  i = 1,  . . . 
n-1 , j = 1 ,2,  and 

{(T(i.l).T(i  ,2))  | i * 1 n-1) 

is  the  set  of  edges  of  a spanning  tree  for  G.  The  algorithm  has  time 
complexity  0(log  n)  and  processor  complexity  nTn/lognl. 

Connected  Components  and  Transitive  Closure 

Let  G * (V,E)  be  an  undirected  graph,  where  V = (1 , . . n). 

I I 

A spanning  forest  for  G is  a forest  S = (V,E  ) where  E * E and  every 


vertex  of  G is  incident  with  some  edge  in  E . ALGORITHM  MST 
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can  be  modified  to  find  the  adjacency  matrix  of  a spanning  forest 
o 

of  G in  time  0(1og  n)  with  nTn/logn]  processors  [26].  Under  this  modifi- 
★ 

cation,  if  t is  the  smallest  index  for  which  G^*  is  a spanning  forest  of 
G,  then  for  i,j  e V,  dt*(i)  » dt*(j)  if  and  only  if  i and  j are  in  the 
same  connected  component  of  G.  The  connected  components  of  G are  com- 
puted in  this  way  by  Hirschberg  in  [11]. 


Theorem  5.  Let  G = ( V , E)  be  an  undirected  graph,  where  V = (1,  . . .,  n}. 

There  is  an  algorithm  which  computes,  from  the  adjacency  matrix  of  G,  a 

lxn  array  D,  where,  for  i , j e V,  D(i)  = D(j)  if  and  only  if  i and  j are 

in  the  same  connected  component  of  G.  The  algorithm  has  time  complexity 
2 

0(log  n)  and  processor  complexity  nTn/logn]. 

Proof:  This  follows  by  combining  ALGORITHM  MIN(2)  with  the 
algorithm  presented  in  [11].  QED 

Corollary  1 . Let  G = (V,E)  be  an  undirected  graph,  where  V = (1,  . . .,  n}. 
Let  fg  = 1 , if  G is  connected,  0,  otherwise.  There  is  an  algorithm  which 
computes  fg,  from  the  adjacency  matrix  of  G,  in  time  0(log  n)  with 
nTn/lognl  processors. 

Proof:  Compute  the  lxn  array  D of  Theorem  5.  Then  G is  con- 
nected if  and  only  if  D(i)  * D(j)  for  all  i , j £ V.  It  was  shown  in  the 
discussion  preceding  ALGORITHM  MST  that  it  is  possible  to  decide  whether 
or  not  all  entries  of  a given  n dimensional  vector  are  the  same  in  time 
0(logn)  with  n-1  processors.  Thus,  If 

/l , if  D(i)  = D(j)  for  all  i , j e V 


0,  otherwise. 
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2 

then  fj,  can  be  computed  in  time  0(  1 og  n)  with  nTn/lognl  processors  from 
the  adjacency  matrix  of  6.  QEO 

Corollary  2.  Let  G = (V,E)  be  an  undirected  graph,  where  V = {1,  ....  n} 
and  let  A be  the  adjacency  matrix  of  G.  There  is  an  algorithm  which  com- 

p 

putes,  from  A,  the  transitive  closure  of  A in  time  0(log  n)  with 
nTn/lognl  processors. 

Proof:  First  compute,  from  A,  the  Ixn  array  0 of  Theorem  5. 

Then  for  each  i ,j  e V,  let 

!1,  if  D(1)  = D(j) 

; 

0,  otherwise. 

★ 

Then  A is  the  transitive  closure  of  A.  QEO 
3.4  Directed  Spanning  Trees 

For  a given  directed  graph  D = (V,E)  let  U(D)  be  the  correspond- 

I 

ing  undirected  graph,  U(D)  = (V,E  ),  where 

. — > — > 

E 3 ( ( i » j ) | (1,j)  e E or  (j,i)  e Eh 

Let  G = (V,E)  be  an  undirected  graph,  where  V = {1,  ....  n).  A 
directed  spanning  forest  of  G is  a directed  forest  T,  where  U(T)  is 
a spanning  forest  of  G.  If  G is  connected,  a directed  spanning  tree  of 
G is  a directed,  rooted  tree,  (T,r),  where  U(t)  is  a spanning  tree  of  G. 


Assume  6 is  connected.  The  algorithms  in  Chapters  6 and  7, 
for  finding  the  bridges  of  G and  a cycle  of  G,  begin  by  finding  a directed 
spanning  tree  of  G.  This  can  be  done  by  using  a modification  of  ALGORITHM 
MST  and  an  algorithm  for  finding  the  transitive  closure  of  a directed 
forest.  It  will  be  shown  in  Lemma  13  of  Chapter  6 that  the  transitive 
closure  of  a directed  forest  can  be  found  in  time  O(logn)  with  n processors. 

Theorem  6.  Let  ty^n)  and  pT(,(n)  be  the  time  and  processor  complexities, 
respectively,  of  an  algorithm  which  finds  the  transitive  closure  of  a 
directed  forest  with  n vertices.  Let  A be  the  adjacency  matrix  of  a 
connected,  undirected  graph  G = (V,E)  with  n vertices.  There  is  an 
algorithm  which  computes,  from  A,  a function  F : V -*■  V such  that  the  graph 

H * (V,  { ( i , F ( i ) ) | i e V,  i f F ( i ) > ) 

2 

is  a directed  spanning  tree  of  G,  in  time  0(log  n + logn*tyC(n))  with 
maxCnTn/lognl,  PyC(n)}  processors. 

Proof:  We  use  the  notation  of  Section  3.2  and  let  F^i)  = 

I 

C^(i)  for  all  i e V.  The  directed  graph 

i 

G1  - (V,  {(i iFj  (1 ) ) | 1 e V,  1 j»  F i ( i ) } ) 

is  a directed  spanning  forest  of  G^.  Assume  inductively  that  for  some 
t 2 1,  a function  F^iV+V  has  been  defined  so  that  the  graph 

G*  = (V,  {(i,FtO))  | i e V,  i f Ft(i )}) 

is  a directed  spanning  forest  of  G^.  Define  Ft+i^V,  so  that  the  graph 
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G^+1  = (V,  {(1,Ft+1(i))  | 1 e V,  i * Ft+1» 

is  a directed  spanning  forest  of  Gt+1 , in  the  following  way.  (See 
Figure  4. ) 

(a)  For  each  i e V with  dt(i)  = i and  Ct+1(i)  t i,  let  Ft+1 (ht+1 (i ) ) = 

W>- 

(b)  For  each  j t dt(j)  on  the  path  in  Gt  from  ht+1(dt(j))  to  dt(j), 
Ft+1(Ft(j))  = j. 

(c)  For  all  other  vertices,  F^ (i ) = Ffc(  1 ) . 

I 

To  implement  the  construction  of  Gt+1  from  Gt>  there  must  be 
a way  to  decide,  for  a given  j e V,  whether  j satisfies  the  conditions 
in  (b).  Let  be  the  adjacency  matrix  of  G^  and  let  be  the  transitive 

I 

closure  of  M^.  Then  j e V is  on  the  path  in  Gt  from  h^+^ ( dt ( j ) ) to 
dt(j)  if  and  only  if  Mt(ht+1 (dt(j)) ,j)  = 1 and  M*(j,dt(j))  = 1.  Thus, 
the  following  modifications  of  ALGORITHM  MST  will  compute  F^  and  will 

I 

compute,  for  t > 1,  Ft+^  from  F..  Replace  step  6 by  step  6 . 

6.  For  each  i e V,  let  F(i)  = C^(i).  Then  F = F^. 

For  each  i e V with  i f F(i ) , replace  M(i,F(i))  by  1. 

I 

Then  M is  the  adjacency  matrix  of  G^. 

i • 

Insert  step  6 j between  steps  6 and  7. 

1 * * 

6 j.  Find  M . Then  M is  the  transitive  closure  of 

I 

the  adjacency  matrix  of  G^. 
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Replace  step  15  by  step  15  . 

I t 

15.  (i)  For  each  i e V with  ( i ) = i and  Cfc+j(i)  t i» 

replace  F(ht+^(i))  by  g ( i ) . For  each  i e V with  i f 
dt(i),  such  that  M (h^+^ (d^(i ))  ,i ) = 1 and  M ( i ,dfc ( i ) ) = 1, 
replace  F(j)  by  i,  where  j = F(i).  Then  Ft+1  " F' 

(ii)  For  each  i e V,  with  i t F(i),  replace  M(i,F(i)) 

I 

by  1.  Then  M is  the  adjacency  matrix  of  Gt+1. 

■j  • 

Insert  step  15  £ between  steps  15  and  16. 

1 * * 

15  j.  Find  M . Then  M is  the  transitive  closure  of 

I 

the  adjacency  matrix  of  Gt+1* 

When  the  modified  algorithm  halts,  the  function  F is  such  that 

the  graph 

6^*  = (V,  {(1  ,F(1 ))  | ieV.iji  F(1)» 

I 

is  a directed  spanning  tree  of  G.  The  root  of  G^*  is  the  vertex  r e V 
for  which  F(r)  = r.  The  nxn  matrix  M stored  in  the  memory  is  the  weight 

I 

matrix  of 

i 

Step  15  can  be  done  in  constant  time  with  n processors  and 

step  15  ^ can  be  done  in  time  tj^(n)  with  PyC (n)  processors.  Thus  in  the 

modified  algorithn,  one  iteration  of  steps  11  through  19  requires  time 

O(logn)  + tyC(n)  and  max  (nTn/lognl,  py^n)}  processors.  Since  steps  11 

through  19  are  iterated  at  most  Tlognl  times,  and  since  the  rest  of  the 

algorithm  can  be  done  in  time  0(1 og  n)  with  Tn  /lognl  processors,  the  en- 

o 

tire  modified  algorithm  can  be  performed  in  time  0(log  n + logn  • tTc(n)) 
with  max  {nfn/lognl,  Py^fn)}  processors.  QED 
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Chapter  4 

B I CONNECTED  COMPONENTS 

Let  G = (V,E)  be  an  undirected  graph.  A vertex  a e V is  an 
articulation  point  of  G if  there  are  vertices  u,  v e V,  distinct  from  a, 
such  that  every  path  in  G joining  u and  v contains  a.  A graph  is  bicon- 
nected  if  it  is  connected  and  contains  no  articulation  points.  A 
bi connected  component  of  G is  a subgraph  H of  G with  the  property  that 
H is  bi connected  and  if  K is  any  bi connected  subgraph  of  G such  that 
H . K then  H = K.  If  V*  _ V,  let 

E (V  ) = {(x,y)  e E | x e V and  y £ V }. 

I I I 

Note  that  if  B = (V  ,E  ) is  a biconnected  component  of  G,  then  e = 

I I I 

E ( V ).  Otherwise,  there  would  be  an  edge  e e E(V  ) - E and  then  K = 

I I 

(V  ,E  U (e))  would  be  a biconnected  subgraph  of  G such  that  B £ K, 
but  B f K. 

In  this  chapter,  an  algorithm  will  be  presented  which  finds 

the  biconnected  components  of  a connected,  undirected  graph  G in  time 
2 2 

0(log  n)  with  n Tn/lognl  processors.  The  algorithm  will  find  the  sets 
of  vertices  of  the  biconnected  components  of  G,  since  the  set  of  vertices 
of  a biconnected  component  determine  it  completely.  The  following 
lemmas  concerning  biconnected  components  will  be  needed. 

Lemma  5.  If  B^  = (V^E^)  and  B2  * (V2,E2)  are  two  distinct  biconnected 


components  of  G then  V1  V2  contains  at  most  one  vertex. 
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Proof:  See  [1],  p.  181.  QED 

Lemma  6.  Vertices  i and  j of  a graph  G are  in  a common  biconnected 
component  of  G if  and  only  if  i = j,  (i,j)  is  an  edge  of  G,  or  i and  j 
lie  on  a common  cycle  of  G. 

Proof:  See  [9],  p.  27.  QED 

Let  G * (V,E)  be  a connected,  undirected  graph.  The  biconnected 
components  of  G will  be  found  in  the  following  way.  First,  a spanning 
tree,  S,  of  G is  found.  It  will  be  shown,  in  Lemma  8,  that  each  edge  of 
S determines  a biconnected  component  of  G and,  in  Lemma  9,  that  every 
biconnected  component  of  G contains  an  edge  of  S.  Next,  a relation  R 
is  defined  on  V,  and  computed  by  the  algorithm,  so  that  iRj  if  and  only 
if  i and  j are  in  a common  biconnected  component  of  G.  It  is  shown  in 
Lenina  8 that  if  (i,j)  is  an  edge  of  S,  and 

V = {k  e V | kRi  and  kRj} 

I I 

then  B = (V  ,E(V  ))  is  the  biconnected  component  of  G containing  (i,j). 

In  this  way,  the  biconnected  component  of  G determined  by  (i,j)  is 
computed. 

Let  R be  the  relation  on  V which  is  defined  above.  Note  that 
R is  reflexive  and  symmetric,  but  not  necessarily  transitive.  For 

each  i e V,  construct  a graph  G^  = (V.E^,  where  Ei  consists  of  all 

* / 

edges  in  E except  those  incident  with  vertex  i.  Let  Gi  = (V.E^  be 
the  transitive  closure  of  G^ . Then  the  following  theorem  holds. 
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Theorem  7.  Let  G * (V,E)  be  a connected,  undirected  graph.  For  i ,j  e V, 

★ 

i R j if  and  only  if  (i,j)  e E^  for  every  k e V,  distinct  from  i and  j. 

(Note  in  particular  that  i Ri  for  all  i e V and  iRj  for  all 
(i,j)  e E.  The  proof  of  this  theorem  relies  on  the  following  lemma  and 
its  corollary.) 

Lenina  7.  Let  G = (V,E)  be  a connected,  undirected  graph.  For  distinct 
vertices  x,  y,  and  z in  V,  if  xRy  and  yRz  and  xRz  then  x,  y and  z are 
contained  in  a common  biconnected  component  of  G. 

Proof:  If  (x,y)  e E,  let  be  the  graph  C1  = ({x,y},  ( ( x ,y ) } ) . 
Otherwise,  let  be  a cycle  of  G containing  x and  y.  Similarly,  define 
and  as  graphs  consisting  of  either  a single  edge,  (y,z)  or  (x,z), 
respectively,  or  a cycle  of  G containing  y and  z,  or  x and  z,  respectively. 
For  i = 1,2,3,  the  graphs  Ci  exist,  by  Lemma  6.  Note  that  for  i = 1,2,  or 
3,  if  u and  v are  vertices  of  C. , and  k is  a vertex  of  V,  distinct  from 
u and  v,  there  is  a path  in  joining  u and  v which  does  not  contain  k. 

Claim:  The  graph  B = is  biconnected.  To  see  this,  let  u and  v 

be  distinct  vertices  of  B,  such  that  u is  a vertex  of  Ci  and  v is  a 

vertex  of  C.  for  some  i,j  e {1,2,3}.  Since  C./lC.  f <p,  let  w e C.AC..  Let 
J ' J 1 J 

k be  a vertex  of  B,  distinct  from  u and  v.  If  w = u or  w = v,  then  u and 

v are  both  in  C.  or  C.  and  there  is  a path  joining  u and  v in  C,  or  C. 

' J * J 

(and  thus  in  B)  which  does  not  contain  k.  If  w is  distinct  from  u and  v,  let 

p^  be  the  path  of  C^,  joining  u and  w,  which  does  not  contain  k,  and  Pj,  the 

path  of  £.,  joining  w and  v,  which  does  not  contain  k.  Then  p^vpi  is 
J J 
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-.1 


I 


a path  in  B joining  u and  v which  does  not  contain  k.  Thus,  B is 
biconnected  and  is  therefore  contained  in  some  biconnected  component, 

I 

B , of  G.  Since  x,  y,  and  z are  vertices  of  B,  they  all  lie  in  the 

I 

biconnected  component  B . QED 

Corollary.  Let  G * (V,E)  be  a connected,  undirected  graph.  For  vertices 
u,  v,  x,  and  y of  V,  such  that  u f v,  assume  uRv  and  xRu,  yRu,  xRv,  and 
yRv.  Then  u,  v,  x,  and  y are  contained  in  a common  biconnected  component 
of  G. 

Proof:  By  Lemma  7,  u,  v,  and  x are  contained  in  a biconnected 
component,  B-j  = (V^,E(V^)),  of  G and  u,  v,  and  y are  contained  in  a bicon- 
nected component,  B£  = (l^.EfVg)).  of  G.  Since  = {u,v}  and  u f v, 

by  Lemma  5,  and  so  B-|  = Thus,  u,  v,  x and  y are  vertices  of 

Br  QED 

Proof  of  Theorem  7:  If  i = j,  the  theorem  is  true.  Assume 

i ^ j.  If  iRj,  let  B = (Vg,E(Vg))  be  the  biconnected  component  of  G 

containing  i and  j.  Let  k be  a vertex  of  V,  distinct  from  i and  j. 

Since  B is  biconnected,  B is  connected  and  there  is  a path  in  B joining 

i and  j.  If  k t Vg  this  path  cannot  contain  k.  If  k e Vg,  k is  not 

an  articulation  point  of  B and  thus  there  is  a path  in  B joining  i and  j 

★ 

which  does  not  contain  k.  Thus  ( i , j ) e E^. 

★ 

Conversely,  assume  (i,j)  e Ek  for  all  k e V,  distinct  from  i 
and  j.  Let  B = (Vg,Eg)  be  a biconnected  component  of  G containing  i. 
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Let 

I 

B = BU{p  | p is  a simple  path  in  G joining  i and  j}. 

I I 

Let  x and  y be  distinct  vertices  of  B . Let  k be  any  vertex  of  B , dis- 
tinct from  x and  y.  If  x,  y e Vg,  there  is  a path  in  B joining  x and  y 
which  does  not  contain  k.  If  neither  x nor  y is  in  Vg,  assume  x lies  on 

the  simple  path  p and  y,  on  the  simple  path  p , joining  i and  j.  There 
x y 

is  a path  in  p^,  joining  either  x and  i or  x and  j,  which  does  not  contain 

k.  Similarly,  there  is  a path  in  p^,  joining  either  y and  i or  y and  j, 

★ 

which  does  not  contain  k.  Since  (i,j)  e E^,  there  is  a simple  path  in 

I 

B , joining  i and  j,  which  does  not  contain  k.  These  paths  can  be  com- 

I 

bined  to  produce  a path  in  B joining  x and  y which  does  not  contain  k. 

A similar  argument  shows  that  if  x e Vg,  but  y t Vg,  there  is  a path  in 

I I 

B joining  x and  y which  does  not  contain  k.  Thus  B is  a bi connected 

I 

subgraph  of  G containing  B.  Then  B = B and,  in  particular,  j e B and 
i R j . QED 

Lemma  8.  Let  S be  a spanning  tree  of  the  connected,  undirected  graph 

G = (V,E)  and  let  e = (x,y)  be  an  edge  of  S.  Then  e is  in  some  biconnected 

component  of  G.  Further,  if 

V - {z  e V | zRx  and  zRy) 

I I I 

then  B = (V  ,E(V  ))  is  the  biconnected  component  of  G containing  e. 

Proof:  By  Theorem  7,  since  e * (x,y)  e E,  xRy,  that  is,  x and  y 
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are  contained  in  a common  biconnected  component,  B = (W,E(W)),  of  G. 

Since  x,y  e W,  e = (x,y)  e E(W). 

For  the  second  part  of  the  theorem,  note  that  since  xRx,  yRy, 

I I I 

and  xRy,  x and  y are  in  V and  thus  (x,y)  e E(V  ).  To  see  that  B = 

I I I 

(V  ,E(V  ))  is  biconnected,  let  u,  v e V . By  the  corollary  to  Lemma  7, 

"tc  * 

uRv,  thus,  by  Theorem  7,  (u,v)  e E^  for  all  vertices  k of  V , distinct 

I I 

from  u and  v.  Thus  B has  no  articulation  points.  To  see  that  B is  a 

II  I I II 

biconnected  component,  let  B = (V  ,E(V  ))  be  a biconnected  component 

I II  III 

of  G containing  B . Let  w be  an  element  of  V . Then  since  x,y  e V 5-  V , 

l I II  I II 

wRx  and  wRy.  Thus  w e V , so  V = V and  B = B . QED 

Lemma  9.  Let  S be  a spanning  tree  of  the  connected,  undirected  graph 

G = (V,E).  Then  every  biconnected  component  of  G contains  an  edge  of  S. 

I I I 

Proof:  Let  B = (V  ,E(V  ))  be  a biconnected  component  of  G. 

If  V has  only  one  vertex,  S contains  no  edges  and  the  lemma  is  true. 

I I 

Otherwise,  since  G is  connected,  V has  more  than  one  vertex  and  E(V  ) is 

I 

nonempty.  If  E(V  ) contains  no  edge  of  S,  let  e = (u,v)  be  an  edge  of 

I 

E(V  ).  Then  adding  edge  e to  S creates  a cycle,  C,  in  S which  contains 
e.  Let  (x,y)  be  an  edge  of  C,  distinct  from  (u,v).  By  Lemma  6,  xRu, 
yRu,  xRv,  and  yRv.  Thus,  since  uRv,  it  follows  from  the  corollary  to 
Lemma  7 that  u,  v,  x,  and  y are  contained  in  a common  biconnected  com- 
ponent, b'  = (v"  ,E  ( V ' ' ) ) of  G.  Since  {u,v}  ?v'rt  v",  b"  = B ' , by 

I I 

Leimia  5.  Thus,  x and  y are  in  V and  (x,y)  e E (V  ),  which  contradicts 

I 

the  assumption  that  E(V  ) contains  no  edge  of  S.  QED 
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Lemmas  8 and  9 verify  that  the  bi connected  components  of  G 

can  be  computed  by  the  method  described  at  the  beginning  of  this  chapter. 

For  each  biconnected  component,  B = (VD,E(VD)),  of  G,  an  n-dimensional 
_>  “ “ — > 

vector  B will  be  computed  so  that  for  each  j e V,  B (j)  = 1 if  and 

only  if  j e Vg.  Before  the  algorithm  is  presented,  one  other  problem 

I 

must  be  handled.  If  S is  a spanning  tree  of  G and  e and  e are  edges 

I 

of  S,  e and  e may  be  in  the  same  biconnected  component  of  G.  If,  for 
every  edge  in  S,  the  biconnected  component  associated  with  that  edge  is 
computed,  some  biconnected  component  of  G will  be  computed  more  than 
once.  This  duplication  can  be  eliminated  in  the  following  way. 

Let  T be  an  (n-l)x2  array,  where  the  set 

{(T(i.,l),T(i,2))  | 1 » 1,  ....  n-1  } 

is  the  set  of  edges  of  a spanning  tree,  S,  of  G.  For  i ,j  e {1,  . . ., 
n-1},  the  edges  (T(i  ,1)  ,T(i  ,2))  and  (T(j ,1 ) ,T(j  ,2) ) are  in  the  same  bi- 
connected component  of  G if  and  only  if  T(i,l)  R T(j,l),  T ( i , 2 ) R T(j,2), 
T(i,l)  R T(j,2),  and  T ( i ,2 ) R T(j,l).  The  necessity  of  this  condition  is 
a consequence  of  the  definition  of  R and  the  sufficiency  follows  from 
the  corollary  to  Lemma  7.  For  i = 1,  ....  n-1,  let  m(i)  be  the  smallest 
index  j for  which  the  edge  ( T ( i , 1 ) , T(i,2))  is  in  the  same  biconnected 
component  of  G as  the  edge  (T( j ,1 ) ,T(j  ,2) ) . Then  the  set  of  edges, 

Q = {(T(i  ,1 ) ,T(i  ,2) ) | i = 1,  ....  n-1  and  m(i)  = 1), 

has  the  property  that  each  biconnected  component  of  G contains  exactly 
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i 


i 


one  edge  of  Q.  From  the  function  m,  the  number,  K,  of  connected  com- 
ponents of  6 can  be  computed  and  a Kxn  array,  B,  can  be  constructed  where 

— > — > 

B ( i » j ) = Bi  (j)  and  the  n-dimensional  vectors,  8^,  . . . , B^,  represent 

the  biconnected  components  of  G. 

For  i = 1 , . . . , n-1 , let 

!i , if  m(i ) = i 

0,  otherwise. 

Let  z(l),  . . .,  z(n-l)  be  the  elements  y(l),  . . y(n-l)  in  decreasing 
order.  The  largest  index  k for  which  z(k)  f 0 is  K,  the  number  of  bicon- 
nected components  of  G and 

Q = {(T(z(i),l),T(z(i),2))  | i - 1.  ....  K}. 


For  i = 1 , . . . , K and  j e V,  let 


B.  (j ) 


1,  if  j RT ( z ( i ) ,1 ) and  jRT(z(k),2) 

I 

10,  otherwise. 


■> 


— > 


Then  the  vectors,  B^ , . . represent  the  biconnected  components 
G.  The  entire  algorithm  is  presented  below,  followed  by  an  analysis 
the  time  and  processor  complexity. 


of 

of 


i 


i 
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ALGORITHM  BICONNECTED  COMPONENTS 


Let  A be  the  adjacency  matrix  of  a connected,  undirected 
graph  G = (V,E),  where  V = (1 , . . . , n).  Find  the  bicon- 
nected  components  of  G in  the  following  way. 

Compute  an  (n-1)  x 2 array  T so  that  the  set 


{(T(i  ,1 ) ,T(i  ,2))  | 1 - 1,  . . n-1} 


is  the  set  of  edges  of  a spanning  tree  of  G. 

2.  For  each  k e V,  construct  an  nxn  matrix  M^,  where  for  i,j  e V, 


Mk(i,j)  = 


0,  if  i = k or  j = k 


A(i  ,j) , otherwise. 


(Then  Mfc  is  the  adjacency  matrix  of  the  graph  G^,  obtained  from  G by 

removing  every  edge  incident  with  i.) 

★ 

3.  For  each  k e V,  compute  M^,  the  transitive  closure  of 

4.  For  each  pair  i , j e V,  let 

R(i,j)  = min  (M*(i,j)  | k e V,  k f i,j}. 

(Then  R(i,j)  = 1 if  and  only  if  i Rj . ) 

5.  For  i * 1 , . . . , n-1 , let 


lL 


m(1)  » min  (j  | j*l , . . .,  n-1  and  R(T(i  ,1 ) ,T( j ,1 ) ) 

* R(T(i  ,2)  ,T( j ,2) ) = R(T(i  ,1 ) ,T(j  ,2) ) 

* R(T(1,2),T(j,l))  * 1). 
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(Then  for  i = 1,  . . . , n-1 , m(i ) = m(j ) if  and  only  if  the  edges 

(T(i ,1 ) ,T(i ,2))  and  (T(j ,1 ) ,T(j ,2) ) are  in  a common  biconnected  component 

of  6.) 

6.  For  i = 1 , . . . , n-1 , let 

fo,  if  m(i)  i i 

y(i) 

li , otherwise. 

7.  Sort  the  array  y ( 1 ) , . . . , y(n-l)  in  decreasing  order  to  obtain  a 
sorted  array  z(l),  . . . , z(n-l). 

8.  Let  K = max  {i  | i=l,  . . n-1  and  z(i)  t 0}.  (Then  K is  the  number 

of  biconnected  components  of  G and  each  biconnected  component  of  G con- 
tains an  edge  (T(j  ,1 ) ,T(j ,2) ) where  j e (1 , . . . , K>.) 

9.  For  i = 1 K and  j e V,  let 


B.(j) 


1,  if  R(j  J(z(i),l))  = R( j ,T(z(i ) ,2) ) = 1 

0,  otherwise. 


— > 


(Then  the  Ixn  arrays  B-j , . . 

■ ■ i 

B-j , . . . , BK  of  G,  where  Bi 


By  represent  the  biconnected  components 
= (VpE(v’))  and  \l\  = {j  c V | B.(j)  = 1}.) 


Notice  that  G is  biconnected  if  and  only  if  K = 1.  Thus,  this 
algorithm  also  determines  whether  a given  connected,  undirected  graph  is 
biconnected. 
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Analysis  of  Time  and  Processor  Requirements 

o 

1.  By  Theorem  4 of  Section  3.3,  step  1 can  be  done  in  time  0(log  n)  with 
nTn/lognl  processors. 

2.  Partition  the  set  {(i,j,k)  | i,j,k  e V}  into  Fn  /lognl  sets  of  at 
most  Tlognl  numbers  each.  Assign  one  processor  to  each  set  X to  compute 
Mk(i,j)  for  each  of  the  triples  (i,j,k)  e X.  Then  step  2 can  be  done  in 

3 

time  O(logn)  with  Tn  /lognl  processors. 

3.  By  Corollary  2 of  Theorem  5,  the  transitive  closure  of  the  adjacency 

2 

matrix  of  an  undirected  graph  can  be  found  in  time  0(log  n)  with 

2 

nTn/lognl  processors.  Thus  step  3 can  be  done  in  time  0(log  n)  with 
2 

n Tn/lognl  processors. 

4.  In  step  4,  the  minimum  of  at  most  n numbers  must  be  found  for  each 

2 

pair  (i,j)  e V x V.  This  can  be  done  in  time  O(logn)  with  n Fn/lognl 
processor. 

5.  In  step  5,  for  each  i e V,  the  minimum  of  at  most  n numbers  must  be 
found.  This  can  be  done  in  time  O(logn)  with  nTn/lognl  processors. 

6.  Step  6 can  be  done  in  constant  time  with  n processors. 

7.  As  discussed  in  Section  2.2,  n-1  numbers  can  be  sorted  in  time 
0(log(n-l))  with  (n-1)  riog(n-l)!  processors. 

8.  In  step  8,  the  maximum  of  at  most  n numbers  can  be  found  in  time 
O(logn)  with  nTn/lognl  processors. 

9.  Step  9 can  be  done  in  constant  time  with  K • n * n processors. 

Thus,  the  total  time  used  by  ALGORITHM  BICONNECTED  COMPONENTS 

2 2 
Is  0(log  n)  and  the  maximum  number  of  processors  used  is  n Tn/lognl,  in 

step  3. 


n 

i! 


The  fastest  serial  algorithm  for  finding  the  bi connected 
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components  of  a graph  6,  with  n vertices  and  m edges,  uses  an  adjacency 

2 

list  representation  of  G and  has  time  complexity  0(m)[l].  If  m = 0(n  ), 

2 

this  algorithm  has  time  complexity  0(n  ).  If  the  adjacency  matrix  of  G 

is  used  as  input,  it  can  be  shown  (see  Theorem  13)  that,  for  some  con- 

2 

stant  c,  at  least  cn  entries  of  the  adjacency  matrix  of  G must  be 

examined  to  determine  whether  or  not  G is  biconnected.  Thus  any  serial 

algorithm  which  determines  bi connectivity  from  the  adjacency  matrix  must 

have  time  complexity  at  least  cn2.  Since  ALGORITHM  BICONNECTED  COMPONENTS 

2 3 

has  time  complexity  0(1 og  n)  and  processor  complexity  0 ( n /logn),  a 
2 2 

speedup  of  0(n  /log  n)  is  achieved  over  the  best  serial  algorithm,  with 
an  efficiency  of  0(l/nlogn).  Although  the  speedup  is  large,  the  efficiency 
indicates  that  a relatively  large  amount  of  waste  is  incurred  by  using 
0(n  /logn)  processors  to  achieve  the  speedup. 


1 


i 
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Chapter  5 
DOMINATORS 

A rooted  graph,  (G,r),  is  a directed  graph  G = (V,E)  with  a 

distinguished  vertex  r such  that  there  is  a path  in  G from  r to  i for 

every  i e V.  For  vertices  i and  j of  V,  i is  a dominator  of  j in 

(G,r)  if  every  path  from  r to  j in  G contains  i.  In  particular,  for 

every  i in  V,  r and  i are  dominators  of  i.  For  vertices  i,j  and  k of 

V,  whenever  i is  a dominator  of  j and  j is  a dominator  of  k,  then  i is 

a dominator  of  k and,  if  k * i,  then  i = j.  Thus,  the  relation  R on  V, 

defined  for  i and  j in  V by  iRj  if  and  only  if  i is  a dominator  of  j, 

is  a partial  order  of  V.  For  each  vertex  j of  V,  let  = {i  e V | iRj}. 

Then  R,  restricted  to  the  set  S.,  is  a total  order  of  S..  For  each 

J J 

vertex  j of  V,  distinct  from  r,  the  vertex  maxR  {i  e | i f j}  is 
called  the  immediate  dominator  of  j in  (G,r).  Since  R is  antisymmetric, 
the  immediate  dominator  of  j is  unique.  The  dominator  relation  R on  V 
can  be  represented  by  a directed,  rooted  tree,  (TD,r)  with  vertex  set 
V in  which  the  father  of  each  vertex  j,  distinct  from  r,  is  the  immedi- 
ate dominator  of  j in  (G,r).  Then  i is  a dominator  of  j in  (G,r)  if 

and  only  if  i is  an  ancestor  of  j in  T^.  T^  is  called  the  dominator 

tree  of  G. 

In  this  chapter,  a parallel  algorithm  is  presented  which  finds, 
for  a directed,  rooted  graph  (G,r),  with  |V|  = n,  an  nxn  array  D, 
where,  for  i,j  e V, 
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, if  i is  a dominator  of  j in  '(G,r) 

i 

0,  otherwise. 

D will  be  called  the  dominator  matrix  of  (G,r).  In  addition,  the  algorithm 
computes  a function  F : V ■+  V,  where  the  directed  graph, 

Td  » (V,  {(1  »F(i ))  | i e V,  i + F ( i ) } ) , 

is  the  dominator  tree  of  (G,r).  The  time  complexity  of  the  algorithm  is 

O(log^n)  and  the  processor  complexity  is  (logn)  n^+log‘7. 

Let  (G,r)  be  a rooted  graph  where  G = (V,E)  and 

V * {1,  . . .,n}.  The  dominator  relationship  on  V can  be  computed  in 

the  following  way.  For  each  vertex  k of  V,  distinct  from  r,  let  G^ 

be  the  directed  graph  obtained  from  G by  removing  all  edges  which  leave 

★ 

the  vertex  k.  For  each  vertex  k of  V,  let  G^  be  the  transitive  closure 
of  the  graph  Gk> 

Lenina  10.  For  i,  j e V,  iRj  if  and  only  if  i = r,  i = j,  or  (r,j)  t G^. 

Proof:  If  iRj,  i f j,  and  i t r,  then  every  path  in  G from 

r to  j contains  i.  Thus,  if  all  edges  leaving  i are  removed  from  G, 

★ 

there  Is  no  pathinG  from  r to  j,  that  is,  (r,j)  t G^.  Conversely, 

★ 

if  i = r or  i * j then  iRj.  If  (r , j ) t G.. , there  is  no  path  from  r to  j 

in  Gi . Thus,  any  path  in  G from  r to  j must  include  i,  so  iRj.  QED 

Thus,  the  following  steps  compute  the  dominator  matrix,  D,  of 
(G,r)  from  r and  the  adjacency  matrix,  A,  of  G. 


1.  For  each  k e V with  k t r,  and  each  pair  i ,j  e V,  let 


r0  if  i = k 


U(i  ,j) , otherwise. 

Then  is  the  adjacency  matrix  of  G^. 

★ 

2.  For  each  k e V with  k / r,  find  M^,  the  transitive  closure  of  M^. 

3.  For  each  i ,j  e V , let 

M,ifi  = rori=j 
D(i  ,j ) * 1 , if  i t r and  M.(r,j)  = 0 
0,  otherwise. 

To  compute  the  dominator  tree  of  (G,r)  let  F(j)  be  the  immediate 
dominator  of  j in  (G,r)  for  each  vertex  j of  V,  distinct  from  r,  and  let 
F(r)  ■ r.  If  the  nxn  array,  D,  has  been  calculated  then,  given  u,  v e S., 

J 

a processor  can  calculate  maxR  {u,v}  in  constant  time,  since 


maxR  {u,v}  * 


v if  R(u,v)  = 1 


u,  otherwise. 


Thus,  ALGORITHM  MIN  (2)  of  Section  2.2  can  be  used  to  find  F(j).  The 
entire  algorithm  is  summarized  and  analyzed  below. 


I 


■ “ 
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ALGORITHM  DOMINATORS 

Let  (G,r)  be  a rooted  graph,  where  G = (V,E)  and 
V = 0 , . . n}.  Given  r and  the  adjacency  matrix.  A, 

of  G,  find  the  dominator  matrix,  D,  of  (G,r)  and  for  each 
r e V,  with  i t r,  find  F(i),  the  immediate  dominator  of  i 
in  (G,r)  in  the  following  way. 

1.  For  each  k e V,  with  k j*  r,  and  each  pair  i ,j  e V,  let 

io,  if  i = k 

\ 

A(i  ,j) , otherwise. 

(constant  time,  n processors) 

★ 

2.  For  each  k e V,  with  k r,  find  M^.  (It  was  mentioned  in  Section 
2.2  that  the  transitive  closure  of  a directed  graph  can  be  found  in  time 
O(log^n)  with  (1ogn)n^og^  processors.) 

3.  For  each  pair  i , j e V,  let 

il , if  i = r or  i = j 
1 , If  i f r and  M*(r,j)  * 0 
0,  otherwise. 

2 

(constant  time,  n processors) 

4.  For  each  j e V with  j f r,  let 

F(j ) = maxR  (i  e V | i t j and  D(i,j)  = 1). 


Let  F(r)  = r.  (This  can  be  done  using  ALGORITHM  MIN  (2)  of  Section  2.2 
in  time  O(logn)  with  nTn/lognl  processors.) 

2 

ALGORITHM  DOMINATORS  has  time  complexity  0(log  n)  and  processor 
complexity  (logn)n1+log7.  The  best  serial  algorithm  for  this  problem 
uses  the  adjacency  list  representation  of  G and  has  time  complexity 

p 

0(nlogn  + in),  where  m is  the  number  of  edges  of  G [29],  If  m = 0(n  ), 

2 

this  algorithm  has  time  complexity  0(n  ).  It  can  be  shown  (see  Theorem 

14)  that  any  serial  algorithm  which  computes  the  dominators  of  G,  from 

2 

the  adjacency  matrix  of  G,  has  time  complexity  at  least  cn  , for  some 

constant  c.  Comparing  the  parallel  and  serial  algorithms,  the  parallel 

2 2 

algorithm  achieves  a speedup  of  0(n  /log  n)  with  an  efficiency  of 
0(1/n1o97_1log3n).  Although  the  speedup  is  large,  there  is  much  waste 
in  using  so  many  processors  to  achieve  it. 


Chapter  6 


BRIDGES 

Let  G = (V,E)  be  a connected,  undirected  graph  with  n vertices. 

I 

An  edge  e of  E is  a bridge  of  G if  the  graph  G = (V,E-{e})  is  not  con- 
nected. A graph  is  bridge  connected  if  it  is  connected  and  contains  no 
bridges.  A bridge  connected  component  of  G is  a maximal  bridge  connected 
subgraph  of  G. 

In  Section  6.1,  a straightforward  algorithm  is  presented  which 

2 2 

finds  the  bridges  of  G in  time  0(log  n)  with  n Tn/lognl  processors.  In 
Section  6.3,  a more  involved  algorithm  is  presented  to  find  the  bridges 
of  G in  time  0(log  n)  using  only  n Tlognl  processors.  It  is  shown  in 
Section  6.4  that  the  bridge  connected  components  of  G can  also  be  found 
in  time  0(log  n)  with  n Tlognl  processors.  Both  algorithms  for  finding 
the  bridges  of  a graph  are  based  on  the  following  lemma.  (See,  for 
example,  [30].) 

Lemma  11.  Let  G = (V,E)  be  a connected,  undirected  graph.  If  e e E is 
a bridge  of  G,  then  e is  contained  in  every  spanning  tree  of  G. 

Proof:  Let  S be  the  set  of  edges  of  a spanning  tree  for  G. 

If  e t S then  removal  of  e from  G does  not  disconnect  G.  Thus  e is  not 
a bridge  of  G.  QED 

6.1  First  Bridge  Algorithm 

Let  G = (V,E)  be  a connected,  undirected  graph,  where 

I 

V = {1,  . . . , n}.  Let  S * (V,E  ) be  a spanning  tree  of  G,  where 


t 
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E = (e,  . . • , en  ^ By  Lemma  11,  if  e is  a bridge  of  G,  then  e e E . 

I 

Thus,  the  bridges  of  G can  be  found  by  testing  each  edge  of  E to  deter- 
mine whether  or  not  it  is  a bridge  of  G.  For  i = 1,  . . .,  n-1 , let  G^ 
be  the  graph  obtained  from  G by  removing  edge  e^  Then  ei  is  a bridge 

I 

of  G if  and  only  if  Gi  is  not  connected.  Let  the  function  f : E -*-{0,1} 

f 

be  defined  for  e.  e E by 

!1 , if  ei  is  a bridge  of  G 

i 

0,  otherwise. 

Then  f can  be  computed  from  the  adjacency  matrix,  A,  of  G in  the  following 
way. 

1.  Compute  an  (n-1)  x 2 array  T so  that  the  set 

E*  = ((T(i  ,1)  ,T(i  ,2))  ( i = 1 n-1} 

is  the  set  of  edges  of  a spanning  tree,  S,  of  G. 

2.  For  k = 1,  . . . , n-1  and  for  each  pair  i , j e V,  let 

Mk(i,j)  = 

Then  is  the  adjacency  matrix  of  the  graph  G^ 

3.  For  1=1,.  . . , n-1 , let 


0,  if  (i,j)  = (T(k,l),T(k,2)) 

f 

A(i  ,j) , otherwise. 


- 


f(1)  - 

Then  f(1)  ■ 1 If  and  only  If  (T(1 ,1 ) ,T( 1 ,2) ) Is  a bridge  of  G. 

Let  K be  the  number  of  bridges  of  G.  Once  f has  been  computed, 
the  bridges  of  G can  be  stored  In  a K x 2 array,  B,  so  that  the  set 

{(8(1 ,1),B(1 ,2))  | 1 - 1 Kl 

Is  the  set  of  bridges  of  G.  The  following  steps  compute  K and  the  K x 2 
array,  B. 

4.  For  i » 1 , , . . , n-1 , let 

y(D  - 

5.  Sort  the  numbers  y(l),  . . .,  y(n-l)  In  decreasing  order  to  obtain 

the  array  z(l) z(n-l). 

6.  Let  m ■ min  (1  1 1 ■ 1,  . . . , n-1 , z{ 1 ) ■ 01 . Let  K • m-1.  Then 
K Is  the  number  of  bridges  of  G. 

7.  For  1 • 1 K and  j - 1,2,  let  B(1,j)  - T(z(1),j).  Then 

i 

{ (B(  1 ,1 ) ,B(  1 ,2))  | 1 - 1 10 


M,  If  f ( 1 ) - 1 


10,  otherwise. 


0,  if  G^  Is  connected 

j> 

1,  otherwise. 


-vet  of  bridges  of  G. 


Notice  that  G is  bridge  connected  if  and  only  if  K = 0.  Thus 
this  procedure  for  determining  the  bridges  of  G also  decides  whether  or 
not  G is  bridge  connected.  The  entire  algorithm  is  summarized  and  analyzed 
below. 

ALGORITHM  BRIDGES  (1) 

Let  G = (V,E)  be  a connected,  undirected  graph,  where 
V * {1,  . . .,  n}.  Using  the  adjacency  matrix,  A,  of  G as  input, 
find  K,  the  number  of  bridges  of  G,  and  a K x 2 array,  B,  so 

that  the  set  {(B(i  ,1 ) ,B(i  ,2))  | i * 1 K}  is  the  set 

of  bridges  of  G,  in  the  following  way. 

1.  Compute  an  (n-1)  x 2 array  T where  the  set 

{ (T  ( i ,1)  ,T(i  ,2))  | i = 1,  . . .,  n-1} 

is  the  set  of  edges  of  a spanning  tree  of  G.  (By  Theorem  4 of  Section 

2 

3.3,  this  can  be  done  in  time  0(log  n)  with  nTn/lognl  processors.) 

2.  For  k = 1,  . . .,  n-1,  and  for  each  pair  (i,j)  e V,  let 

10,  if  i = T(k,l ) and  j = T(k ,2) 

0,  if  i = T(k,2)  and  j = T(k,l) 

A(i  ,j) , otherwise. 

(Partition  the  set  of  triples,  {(i,j,k)  ( 1 , j e V,  k = 1,  . . .,  n-1},  into 

3 

Pn  /lognl  sets  of  at  most  Tlognl  elements  each  and  assign  one  processor 
to  each  set.  Then  step  2 can  be  done  in  time  0(logn)  with  I'nVlogn'l 
processors. ) 
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I r 


3.  For  i=l,.  . . , n-1 , let 


(0,  if  is  connected 


f(i)  - 


ll , otherwise. 


(By  Corollary  1 of  Theorem  5,  this  can  be  done  in  time  0(log  n)  with 
2 

n Tn/lognl  processors.) 

4.  For  i = 1 , . . . , n-1 , let 


'i,  if  f(i ) = 1 


y(i)  = 

^0,  otherwise. 

(constant  time,  n-1  processors) 

5.  Sort  the  array  y ( 1 ) , . . . .,  y(n-l)  in  decreasing  order  to  obtain 
z(l),  . . .,  z(n-l).  (As  mentioned  in  Section  2.2,  this  can  be  done  in 
time  0(logn)  with  nTlognl  processors.) 

6.  Let  m = min  (i  | i * 1,  . . . , n-1  and  z(i)  = 0).  Let  K = m-1. 

(The  minimum  of  n numbers  can  be  found  in  time  0(1 ogn)  with  Tn/lognl 
processors. ) 

7.  For  i ■ 1 , . . . , K and  j * 1,2,  let  B(i,j)  = T(z(i),j).  (constant 
time,  n processors) 

p 

ALGORITHM  BRIDGES  (1)  has  time  complexity  0(log  n)  and  processor 
2 

complexity  n Tn/lognl.  The  best  serial  algorithm  for  finding  the  bridges 
of  G uses  an  adjacency  list  representation  of  G and  has  time  complexity 


r I 


77 


2 

0(m+n),  where  m is  the  number  of  edges  of  G [30].  If  m = 0(n  ),  this 

2 

algorithm  has  time  comolexity  0(n  ).  It  follows  from  Theorem  14  of 

Section  8.2  that  any  algorithm  which  decides,  from  the  adjacency  matrix 

of  G,  whether  or  not  G is  bridge  connected,  must  have  time  complexity 

at  least  cn2,  for  some  constant  c.  Comparing  ALGORITHM  BRIDGES  (1)  with 

2 o 

the  best  serial  algorithm  shows  a speedup  of  0(n  /log  n)  and  an  efficiency 
of  0{l/nlogn).  It  will  be  shown  that  the  efficiency  can  be  improved  by 
using  ALGORITHM  BRIDGES  (2)  of  Section  6.3. 

6.2  Preliminary  Algorithms 

In  this  section,  algorithms  are  presented  to  solve  problems 
which  arise  in  the  improved  algorithm,  of  Section  6.3,  for  finding  the 
bridges  of  a graph. 

Lennia  12.  Let  V * (1 , . . . , n}  and  let  F : V-*-V  be  a function.  For 

each  i e V,  let  F°( 1 ) = i and  for  k = 1 , . . . , n-1 , let  Fk(i)  = 

F(Fk“\i)).  Then  for  a given  j e V,  Fk(j),  for  k = 0,  . . . , n-1  can  be 

computed  in  time  O(logn)  with  n processors. 

Proof:  For  each  i e V,  let  Y (i)  = F(i)  and  for  t = 1,  . . ., 

0 t 

r 1 og (n-1  )1 , let  Yt(1)  - Yt-1  (Y^ (i )) . Then  Yt(1)  = F2  (i)  for  all 
1 e V.  Assume  that  for  t * 0,  . . .,  riog(n-l)],  Y^ ( i ) has  been  computed 
for  all  i e V.  Assume  that  for  some  r,  with  0 - r < riog (n-1 )1 , F (j) 
has  been  computed  for  0 S k S 2r  • 1.  Then  Fk(j),  for  2r  < k < 2r+1-l, 

p.  1 

can  be  computed  as  follows.  For  0 < s < 2 , 
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F2r+S(j)  « F2r(FS(j)) 

s 2r 

Since  it  was  assumed  that  F (j)  has  been  computed  and  F (i)  has  been 

2r  s 

computed  for  all  i e V,  F (F  (j))  can  be  computed  in  constant  time.  Thus 
the  following  steps  compute  F (j),  for  k - 0,  . . n-1,  from  F. 

1.  Let  t - 0 and  let  YQ(i)  * F(i)  for  all  i e V. 

2.  If  t i log(n-l),  go  to  step  4,  having  computed 

Y0(i),  ....  Yri0g(n-i  )i^)  for  a11  i £ V-  Otherwise,  increment  t by  1 . 

3.  Let  Yt(i)  * Y^i ( Yt_i ( i ) ) for  all  i e V. 

Then  return  to  step  2. 

4.  Let  r = 0. 

1/ 

5.  If  r 2 log(n-l),  stop,  having  computed  F (j)  for  k = 0,  . . .,  n-1. 
Otherwise,  increment  r by  1 . 

6.  For  s = 0,  ....  2r  - 1,  let  F2  +s(j)  = F2  (Fs(j)).  Then  return 
to  step  5. 

Each  of  steps  1 through  6 can  be  done  in  constant  time  with  at 
most  n processors.  Since  steps  2 and  3,  and  5 and  6 are  iterated  at  most 
riog(n-l)!  + 1 times,  the  entire  algorithm  can  be  done  in  time  0(logn) 
with  n processors.  QED 

Theorem  8.  Let  G * (V,E)  be  a directed  forest  with  n vertices.  Let 
F : V-+V  be  a function  such  that 

E - {(1,F(1))  | i e V,  i t F(i)}. 


From  F,  the  transitive  closure  of  G can  be  found  in  time  O(logn)  with 
2 

•n  processors. 

* 

Proof:  Let  A be  the  adjacency  matrix  of  G and  let  A be  the 

★ 

transitive  closure  of  A.  For  i,j  e V,  A (i,j)  3 1 if  and  only  if  j 3 

f it 

F (i)  for  some  t = 0,  . . n-1 . Thus  A can  be  computed  in  the  follow- 

ing way. 

1.  For  each  i e V,  compute  Ft(i)  for  t = 0,  . . . , n-1.  (By  Lemma  12, 
this  can  be  done  in  time  0(logn)  with  n processors.) 

2.  For  each  i e V,  let 

m(i)  * min  (t  | t * 0,  . . .,  n-1  and  Ffc(i)  3 Ft+1(i)}. 

Then  for  each  i e V,  F°(i),  . . .,  are  distinct  and  F*(i)  - 

Fy(1)  for  all  x,y  l m(i). 

(Since  for  each  i e V,  the  minimum  of  at  most  n numbers  must  be  found, 

this  can  be  done  in  time  0(logn)  with  n[n/logn]  processors.) 

* 2 

3.  For  each  pair  i,j  e V,  let  A (i,j)  3 0.  (constant  time,  n processors.) 

4.  For  each  i e V,  replace  A*(i,Ft(1))  by  1 for  t 3 0,  . . .,  m(i). 

2 

(constant  time,  n processors) 

~ * 

Thus,  computation  of  A from  F can  be  carried  out  in  time 
o 

0(logn)  with  n processors.  Notice  the  necessity  of  step  2.  If  for  some 
i e V there  are  x,  y such  that  x / y and  m(i)  S x,y  s n-1,  then  in  step 
4,  the  two  processors  associated  with  1,x  and  i,y  would  both  be  instructed 
to  store  a 1 in  the  same  memory  location,  A (i.F^^O)).  This  is  not 
allowed  by  the  model  of  computation.  QED 
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Corollary.  Let  G = (V,E)  be  a connected,  undirected  graph,  where 
V = {1 , . . n}.  There  is  an  algorithm  which  computes  a function 

F : V-*V,  so  that  the  graph, 

> 

T - (V,  { ( 1 ,F( i ) ) | i e V,  i f F(i)}) 

2 2 

is  a directed  spanning  tree  of  G,  in  time  0(log  n)  with  n processors, 
from  the  adjacency  matrix  of  G. 

Proof;  This  follows  from  Theorem  6 of  Section  3.4  and  Theorem 

8.  QED 

Let  G * (V,E)  be  a rooted,  directed  tree  and  let  F : V-*V  be 
a function  such  that 

E * {(1,F(1))  | 1 e V,  i f F(1 )}. 

Let  b : V-*T0,1}  be  any  function  and  let  f:  V-*-{0,l}  be  defined  for  1 c V 

by 

!1,  if  for  some  descendant  k of  i in  G,  b(k)  - 1 

; 

0,  otherwise.  , 

k ( 

In  the  improved  bridges  algorithm,  it  is  necessary  to  compute  f from  F. 

2 

Although  this  can  be  done  in  a straightforward  way  in  time  0(log  n)  with  * 

2 

n processors,  it  will  be  shown  in  Theorem  9 that  f can  be  computed  with 
only  nTlognl  processors.  This  Is  the  crucial  step  in  reducing  the  number 
of  processors  used  to  find  the  bridges  of  a graph  from  n to  n Mognl. 


i 


81 


Define  FK(i)  for  all  i e V and  for  k = 0,  . . n-1  as  before. 

For  i,j  e V,  j Is  a descendant  of  i if  and  only  if  F (j)  = i for  some 

k e {0,  . . n-1}.  Let  Y ( 1 ) = F(i)  for  all  i e V.  For  t ■ 1,  . . 

0 t 

riog(n-l)l  let  Yt(1)  = Yt_l(Yt-l(i))  = F2  (i).  Define  ft  : V-{0,1},  for 


all  i e V and  for  t = 0, 


, flog (n-1 ) 1 , by  ft(i)  s 1 if  and  only  if 


for  some  v e V and  some  k with  0 < k < 2 , F (v)  * i and  b(v)  = 1. 

Then  ^riog(n-l)!^  * f(i)  for  all  1 e V.  Assume  that  for  all  i V and 

for  0 < t s riog(n-l)!,  Yt(i)  has  been  computed.  It  will  be  proven  in 
Theorem  9 that  f can  be  calculated  from  b in  the  following  way. 

ALGORITHM  COLLECT 


1.  Let  t * 0 

2.  For  each  1 e V, 

(a)  let  f(i)  - b(1) 

(b)  let  fQ(1)  • f(1). 

3.  If  t > log(n-l),  stop,  having  computed  f ri0g( n.-| ) 
Increment  t by  1. 


f.  Otherwise, 


4.  For  all  1 e V,  let 


yt(D 


if  * 1 


0,  otherwise. 


« 


r 


i 
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5.  Sort  yt(l)»  yt(n)  in  decreasing  order  to  obtain 

z^(l),  • • • i zt(n). 

6.  Let  st(l)  = zt(l ) and  for  i = 1,  . . . , n-1,  let 


' 0 , if  zt(i)  = zt(i+l) 


st(i+l) 


Lzt(i+1),  otherwise. 


(Then  all  nonzero  entries  of  the  lxn  array  st  are  distinct  and  every 


nonzero  entry  of  the  array  yt  appears  in  the  array  st.  Note  that  if 


the  nonzero  entries  of  st  were  not  all  distinct,  then  in  step  7, 


two  processors  would  attempt  to  change  the  content  of  the  same  memory 
location.  This  is  not  allowed  by  the  model.) 


7.  For  each  i e V with  s^(i)  f 0,  replace  f(j)  by  1,  where 


j * st(1).  Then,  for  all  i e V,  let  ft(i)  * f(i).  Return  to 


step  3. 

(Notice  that  for  all  j e V,  if  ft(j)  = 1,  then  f t • ( j ) = 1 for  all 
t'  i t.) 


In  ALGORITHM  COLLECT,  f^i ) = I,  for  t > 1 if  and  only  if 

f.  ,(i)  = 1 or  i appears  in  the  array  s..  The  vertex  i appears  in  the 
t_1  1 2t-l 

array  s^  if  and  only  if  there  is  a v e V such  that  Yt  -j (v ) = F (v)  = i 

and  f £_.|  ( v)  = 1. 


Lemma  13.  For  t = 0,  . . Tlog (n-1  )1  and  i e V,  let  ft(i)  be  as  defined 
in  ALGORITHM  COLLECT.  Then  ft(i)  = 1 if  and  only  if  there  is  a vertex  v 
of  V with  b(v)  = 1 and  Fk(v)  = i,  where  0 - k < 2t. 

Proof:  If  t = 0,  fQ(i)  = 1 if  and  only  if  b(i)  = 1 and  there- 
fore the  lenma  is  true.  Assume  inductively  that  for  some  t i 0 the 
lemma  is  true.  Assume  ft+^(i)  = 1.  If  ft(i)  = 1,  then  by  the  induction 

js 

hypothesis  there  is  a vertex v with  b(v)  = 1 and  F (v)  = i,  where  0 i k < 2 

< 2t+1,  so  the  lemma  is  true.  If  Mi)  t 1,  then  i must  appear  in  the 

t t 

2 1 

array  sfc+j.  Thus,  there  is  a vertex  w such  that  ft(w)  * 1 and  F (w)  = i 

I 

By  the  induction  hypothesis,  there  is  a vertex  u and  a k such  that 

I ' t ' 

b(u)  * 1,  0 i k < 2fc,  and  Fk  (u)  * w.  But  then  i = F2  (Fk  (u))  and  since 
0 S 2*  S 21  + k < , the  lemma  is  true.  Conversely,  let  i and  v be 

vertices  of  V such  that  b(v)  * 1 and  F (v)  = i,  where  0 $ k < 2 . 


If  Mi)  = 1,  then  f.., (i)  = 1.  Otherwise,  by  the  induction  hypothesis, 
t I » 

2t  < k < 2t+1.  Let  k'  * k - 2t.  Then  0 < k’  < 2t.  Let  w = Fk  (v). 

By  the  induction  hypothesis,  f t (w)  = 1.  Since 


F2t(w)  = F2t(Fk-2t(v)) 


Yt(w)  = i,  and  thus  i appears  in  the  array  st+-j 


Then  f ^(i)  = 1.  QED 
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Thus,  ALGORITHM  COLLECT  calculates  ^f)0g(n_i )] » where  for  i e V, 
friog(n-l)l(i)  = ^ ^ and  on^  ^ ^ere  is  a v e V with  b(v)  = 1 and 
F^(v)  = i for  some  k with  0 ^ k < that  is,  if  and  only  if 

there  is  a descendant  of  i in  G with  b(v)  = 1.  Thus  f = ^ri og (n-1 )V 

Theorem  9.  Let  G = (V,E)  be  a directed,  rooted  tree,  with  V = (1,  . . .,  n}. 
Let  F : V-*V  be  a function  such  that 

■> 

E = { (1 ,F(1 ) ) | i e V,  i f F(i)}. 

Let  b : V-»-{0,l } be  any  function  and  define  f : V-*-{0 , 1 } for  i e V by 

!1,  if  for  some  descendant  j of  i,  b(j)  = 1 

; 

0,  otherwise. 

p 

Then  f can  be  computed  from  F and  b in  time  0(1 og  n)  with  nTlognl  processors. 

Proof:  It  was  shown  in  the  proof  of  Lemma  12  that  for 
t = 0,  . . .,  riog(n-l)!,  Yt(i)  = F (i)  can  be  computed  from  F for  all 
i e V in  time  0(logn)  with  n processors.  It  has  been  shown  in  Lemma  13 
that  ALGORITHM  COLLECT  computes  f,  given  b and  the  functions  Yq,  , . . ., 

^riog(n-l)V  ^ remains  t0  ^ind  time  and  Processor  complexities  of 
ALGORITHM  COLLECT.  All  steps,  except  step  5 can  be  done  in  constant 
time  with  n processors.  In  step  5,  n numbers  must  be  sorted, 
which  takes  time  0(logn)  and  nflognl  processors.  Since  steps  3 through 

7 are  iterated  at  most  F 1 og ( n-1 ) 1 + 1 times,  ALGORITHM  COLLECT  can  be 
2 

done  in  time  0(log  n)  with  nHognl  processors.  Thus  f can  be  computed 


from  b and  F in  time 


O(logn)  + 0(log2n)  = O(log^n) 


with 


max  {n,  nTlognl}  = nTlognl 

processors.  QED 

6.3  Improved  Bridge  Algorithm 

Let  G = (V,E)  be  a connected,  undirected  graph,  where 
V = {1 , . . .,  n}.  Let  T = (V,E(T))  be  a directed  spanning  tree  of  G 
and  let  F : V-*-V  be  such  that 

E(T)  = {( i,F(i))  | 7 e V,  i f F(i)>. 

Then  the  undirected  graph  S = (V,E(S)),  where 

E(S)  = {(i ,F(i ))  | i e V and  F(i)  f i}, 

is  a spanning  tree  of  G.  By  Lemma  11,  every  bridge  of  G is  in  E(S).  In 
the  algorithm  presented  in  this  section  for  finding  the  bridges  of  G, 
a test  is  made  on  each  vertex  i e V,  with  i ? F ( i ) , to  determine  whether 
or  not  the  edge  (i,F(i))  of  E(S)  is  a bridge  of  G.  The  test  made  on 
each  vertex  is  based  on  the  following  theorem. 

Theorem  10.  Let  i be  a vertex  of  V such  that  F(i)  j1  i.  The  edge  (i,F(i)) 
of  S is  not  a bridge  of  G if  and  only  if  either 

(i)  there  is  an  edge  of  G joining  a proper  descendant  of  i 
and  a nondescendant  of  i in  S,  or 


(ii)  there  is  an  edge  of  G joining  i and  a nondescendant  of 
i in  S,  other  than  F(i). 

The  proof  of  Theorem  10  relies  on  Lemma  14  below. 

Lemma  14.  The  edge  e of  G is  a bridge  of  G if  and  only  if  there  is  no 
cycle  of  G containing  e. 

Proof:  See  [9],  p.  27.  QED 

Proof  of  Theorem  10:  Assume  that  (i,F(i))  is  not  a bridge  of 
G.  Let  vQ,v1,  . . .,  v^  beacycle  in  G containing  (i,F(i)),  where 
vQ  = F(i)  = and  = i.  Then  k > 3,  so  i,F{i)  and  are  distinct 

vertices.  If  v2  is  a nondescendant  of  i in  S,  then  the  edge  (i.v^)  of  G 
is  an  edge  as  described  in  (ii)  of  the  theorem.  If  v2  is  a descendant 
of  i,  let  t be  the  largest  index  for  which  vt  is  a proper  descendant  of 
i in  S.  Then  2 S t s k-1  and  vt+-j  is  a nondescendant  of  i in  T and  thus 
(vt,vt+1)  is  an  edge  of  G as  described  in  (i)  of  the  theorem.  The 
converse  is  proven  as  follows. 

(i)  Assume  there  are  vertices  j and  k of  V such  that  j is  a 
proper  descendant  of  i , k is  a nondescendant  of  i and  (j,k)  is  an  edge 
of  G.  Let  p be  the  simple  path  in  S joining  j and  i.  Let  x be  the 
youngest  coirmon  ancestor  of  i and  k in  S.  Let  q be  the  simple  path  in  S 
joining  i and  x,  and  r,  the  simple  path  in  S joining  k and  x.  Since  i,j, 
and  k are  all  distinct,  p u q u r u { ( j , k ) > is  a cycle  in  G.  Since  k is  a 
nondescendant  of  i , x f i , so  that  F(i)  is  a descendant  of  x.  Then 
(i,F(i))  is  on  path  q and  is  in  the  cycle  p u q u r u { ( j ,k ) } and  thus 
is  not  a bridge  of  G by  Lemma  14. 
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(ii)  Assume  there  is  a vertex  k of  V such  that  k is  a nonde- 
scendant of  i , (i,k)  is  an  edge  of  G,  and  k f F(i).  Let  x e V be  the 
youngest  common  ancestor  of  F(i)  and  k.  Let  p be  the  simple  path  in  S 
joining  F(i)  and  x,  and  q,  the  simple  path  joining  k and  x.  Either  p or  q 
may  contain  no  edges,  in  case  F(i)  = x or  k = x,  but  since  k f F(i),  at 
least  one  of  p and  q must  contain  an  edge.  Then  p v q o { (i  ,F(i) ), (i  ,k)> 
is  a cycle  of  G containing  (i,F(i))  so  that  (i,F(i))  is  not  a bridge  of 
G,  by  Lemma  14.  QED 

ALGORITHM  BRIDGES  (2)  will  work  in  the  following  way.  First, 
a function  F : V+V  is  computed  so  that  the  graph 

T = (V,  {(i  ,F(i))  | i e V,  F(i)  f 1}) 

is  a directed  spanning  tree  of  G.  The  adjacency  matrix,  M^,  of  T is 
★ 

constructed  and  MT>  the  transitive  closure  of  M^.,  is  computed.  Then 
for  i,j  e V,  j is  a descendant  of  i in  S,  where 

S = ( V , { ( i , F ( i ) ) | i e V,  1 t F(i)}) 

★ 

if  and  only  if  M^.  (i,j)  = 1.  Next,  for  each  i in  V,  a 1 x n array,  D.. , 
is  computed,  where  for  j e V, 

!1,  if  j is  a nondescendant  of  i in  S and  there  is  an  edge 
of  G joining  j and  a proper  descendant  of  i in  S 
1,  if  j is  a nondescendant  of  i in  S and  j t F(i) 
and  £i,j)  is  an  edge  of  G 

0,  otherwise. 
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Then  by  Theorem  10,  for  each  i in  V,  such  that  i + F( i ) , the  edge 
(i,F(i))  of  G is  a bridge  of  G if  and  only  if  the  1 x n array  D„.  does  not 
contain  a 1 . 


Calculation  of  D 


i 


For  each  j e V,  let  b.  : V-K0,1}  be  defined  for  each  i e V by 

J 

11,  if  (i,j)  is  an  edge  of  G,  j t F(i),  and  j is  a 
nondescendant  of  i in  S 

1 

0,  otherwise. 

For  each  j e V,  let  f^  : V-*-V  be  defined  for  each  i e V by 


f(j)(i)  = 


[1 , if  some  descendant  k of  i has  b.(k)  = 1 

J 


10,  otherwise. 


:(  j ) 


Lemma  15.  Let  the  functions  b.  and  f'  , for  j e V,  be  defined  as  above. 

J 

For  each  pair  i , j e V,  let 

!1,  if  f^(i)  * 1 and  j is  not  a descendant  of  i in  S 
0,  otherwise. 


Then  D7(j)  = 1 if  and  only  if  either 

(i)  j is  a nondescendant  of  i and  there  is  an  edge  of  G joining 
j and  a proper  descendant  of  i or. 
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(ii)  j is  a nondescendant  of  i , j j*  F(i),  and  (i,j)  Is  an 

edge  of  G. 

Proof:  Assume  D^j)  = 1.  Then  j is  a nondescendant  of  i and 
f^)(i)  = 1.  Thus,  by  definition  of  f^,  there  is  a descendant  k of  i 
for  which  bj(k)  « 1.  If  k = i,  then  by  de  ini  t i on  of  b^ , (ii)  holds. 

If  k f i,  then  k is  a proper  descendant  of  i and  thus  (i)  holds. 

Conversely,  assume  j Is  a nondescendant  of  i in  S. 

(i)  Assume  there  is  a proper  descendant  k of  i and  an  edge 

(k,j)  of  G.  Then  j 1 F(k)  and  j is  a nondescendant  of  k.  Thus,  by 

definition  of  b.,  b.(k)  = 1 so  that  f^(i)  = 1 and  D.(j)  * 1. 

J J 1 

(ii)  If  j ^ F(i)  and  (i,j)  is  an  edge  of  G then  by  definition 

of  b.,  b.(i)  = 1.  Thus  f^(i)  = 1 and  so  D.  (j)  = 1.  QED 

J J 1 

By  Lemma  15,  the  following  steps  compute  D^(j)  for  all  i , j e V, 
given  A,  the  adjacency  matrix  of  G. 

1.  Compute  a function  F : V-*V  so  that  the  graph 

T = (V,  {(1  ,F(i))  | i e V,  i * F(i )}) 

is  a directed  spanning  tree  of  G.  Let  My  be  the  adjacency  matrix  of  G. 

(By  Theorem  6 and  the  corollary  to  Theorem  8,  this  can  be  done  in  time 
2 2 

0(log  n)  with  n processors.) 

2.  Compute  My,  the  transitive  closure  of  My.  (By  Theorem  8,  this  can 
be  done  in  time  O(logn)  with  n processors.) 

3.  For  each  pair  i , j e V,  let 
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1.  if  A(i,j)  = 1,  j / F(i)  and  MT(j,i)  = 0 


bjO)  - 


0,  otherwise. 


o 

(This  can  be  done  in  constant  time  with  n processors.) 

4.  For  each  pair  i,j  e V,  compute  : V-^{ 0,1}  where  f^(i)  * 1 if 

and  only  if  some  descendant  k of  j has  b,(k)  =1.  (By  Theorem  9,  for  a 

. , ( -j ) o 

given  j e V,  f'J'  can  be  computed  from  b.  and  F in  time  0(log  n)  with 

J 

npogn]  processors.  Thus  step  4 can  be  done  in  time  O(log^n)  with 
2 

n Tlognl  processors. 

5.  For  each  pair  i,j  e V,  let 


DfU)  = 


1,  if  fj ( i ) * 1 and  MT(j,i)  = 0 

> 

) 

10,  otherwise. 


2 

(constant  time,  n processors) 

Let  K be  the  number  of  bridges  of  6.  A K x 2 array,  B,  where 

the  set 


{(B(i ,1 ) ,B(i ,2) ) | i = 1 K) 


is  the  set  of  bridges  of  G,  can  be  computed  from  the  arrays  0^,  . . . , Dp 
in  the  following  way. 

6.  For  each  i e V,  let 


M , if  F(i)  / i and  the  1 x n array  0^  does  not  contain 
m(i)  » \ a 1 


0,  otherwise. 


(It  was  shown  in  Section  2.2  that  it  is  possible  to  determine  whether  or 
not  an  n-dimensional  binary  vector  contains  a 1 in  time  O(logn)  with  n 
processors.  Thus,  step  6 can  be  done  in  time  O(logn)  with  n processors.) 

7.  For  each  i e V,  let 

1i , if  m(i ) * 1 

; 

0,  otherwise. 

(constant  time,  n processors) 

8.  Sort  the  array  y(l),  ....  y(n)  in  decreasing  order  to  obtain  the 
array  z(l),  . . .,  z(n).  (0(logn)  time,  nTlognl  processors,  as  in 
Section  2.2) 

9.  Let  K = min  (j  e V | z(j)  = 0).  Let  K = K - 1.  (0(logn)  time, 
Tn/lognl  processors,  as  in  Section  2.2) 

10.  For  i * 1,  . . .,  K,  let  B( i ,1 ) = z(i)  and  B( i ,2)  = F(z(i)). 

(constant  time,  n processors) 

After  step  10,  the  set 

{ (B( i ,1 ) ,B(i  ,2))  | i = 1 K} 

is  the  set  of  bridges  of  G.  Note  that  G is  bridge  connected  if  and 
only  if  K = 0.  The  algorithm  is  summarized  and  analyzed  below. 

ALGORITHM  BRIDGES  (2) 

Let  G = (V,E)  be  a connected,  undirected  graph,  where 
V » {1,  . . ., 


n).  From  A,  the  adjacency  matrix  of  G,  find 
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K,  the  number  of  bridges  of  G,  and  a K x 2 array,  B,  where 

{(B(1 ,1 ) ,B(i,2) ) (1  = 1 K} 

is  the  set  of  bridges  of  G,  by  performing  the  steps  1 through 
10  above. 

Then  ALGORITHM  BRIDGES  (2)  has  time  complexity  O(log^n)  and 
2 

processor  complexity  n Tlognl.  As  discussed  in  Section  6.1,  the  best 

2 

serial  algorithm  for  this  problem  has  time  complexity  0(n  ).  The  speedup 

in  computation  time  which  results  from  using  the  parallel  algorithm  instead 

2 2 3 

of  the  serial  algorithm  is  0(n  /log  n)  and  the  efficiency  is  0(1/1 og  n). 

2 

Thus,  a relatively  small  amount  of  waste  is  incurred  by  using  n Tlognl 

2 2 

processors  to  achieve  a speedup  of  0(n  /log  n)  in  computing  the  bridges 
of  a graph. 

6.4  Bridge  Connected  Components 

Let  G = (V,E)  be  a connected,  undirected  graph.  The  bridge 
connected  components  of  G can  be  computed  using  the  following  theorem. 

Theorem  11 . Let  B be  the  set  of  bridges  of  G.  A subgraph  H of  G is  a 
bridge  connected  component  of  G if  and  only  if  it  is  a connected  component 
of  the  graph  G ■ (V,E-B). 

I 

Proof:  Assume  H is  a connected  component  of  G . If  (i,j) 

I 

Is  a bridge  of  H,  when  (i,j)  is  removed  from  H there  is  no  path  in  G 
joining  i and  j.  But  since  (i,j)  is  not  a bridge  of  G,  there  must  be 
a simple  path  v-|,  . . v^  in  G,  where  i a v^,  j = v^  and  for  some  t. 
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with  1 < t < k-1,  (vt*vt+i)  *s  a bridge  of  G.  This  is  a contradiction 
since  v^,  ....  v^,  v^,  . . . , is  a path  in  G joining  v^.  and  vt+^ 
which  does  not  contain  (vt»vt+j).  Thus  H is  bridge  connected.  Let  K ? H 
be  a connected  subgraph  of  G with  H £ K.  There  is  a vertex  v of  K which 

I 

is  not  in  H.  If  u is  a vertex  of  H,  there  is  no  path  in  G joining  u and 
v.  Thus  any  path  in  K joining  u and  v must  contain  a bridge.  Then  K 
is  not  bridge  connected  and  thus  H is  a bridge  connected  component  of  G. 
Assume  H is  a bridge  connected  component  of  G.  Then  H is  a 

I I 

connected  subgraph  of  G . Let  K be  the  connected  component  of  G con- 
taining H.  Since  K Is  bridge  connected  and  HSKiG,  H = K and  thus  H 

1 

is  a connected  component  of  G . QED 

By  Theorem  11,  the  bridge  connected  components  of  G can  be 
computed  by  finding  the  bridges  of  G,  removing  all  bridges  from  G to 

I I 

obtain  a graph  G , and  then  finding  the  connected  components  of  G . The 

algorithm  below  uses  this  technique  to  compute  the  bridge  connected 

2 2 

components  of  G in  time  0(log  n)  with  n Mognl  processors. 

ALGORITHM  BRIDGE  CONNECTED  COMPONENTS 

Let  G = (V,E)  be  a connected,  undirected  graph,  where 
V * {1,  ...»  n}.  From  the  adjacency  matrix.  A,  of  G,  compute 
a 1 x n array  0 where  D(i)  = D(j)  if  and  only  if  i and  j are 
in  the  same  bridge  connected  component  of  G. 

1.  Compute  a K x 2 array  B,  where  K is  the  number  of  bridges  of  G 
and  the  set 
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{ ( B ( i ,1 ) ,B(i  ,2))  | 1 • 1,  ....  K} 

2 

is  the  set  of  bridges  of  G.  (This  can  be  done  in  time  0(log  n)  with 
n^Tlognl  processors  by  using  ALGORITHM  BRIDGES  (2).) 

2.  For  i = 1 , ....  K,  replace  A ( B ( i ,1 ),B(i ,2))  by  0.  A is  now  the 

I 

adjacency  matrix  of  G . (constant  time,  K < n processors) 

3.  Compute  a 1 x n array,  D,  where  D(i)  = D(j)  if  and  only  if  i and  j 

I 

are  in  the  same  connected  component  of  G . (By  Theorem  5,  this  can  be 
2 

done  in  time  0(log  n)  with  nTn/lognl  processors.) 
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Chapter  7 


CYCLES 

In  this  chapter,  problems  involving  cycles  in  a graph  are 

investigated.  In  Section  7.1,  it  is  shown  that  if  G is  a connected, 

undirected  graph  with  n vertices,  a cycle  of  G can  be  found  in  time 
? 2 

0(log  n)  with  n processors,  if  one  exists,  and  a cycle  basis  for  G can 

2 3 

be  found  in  time  0(log  n)  with  n processors.  In  Section  7.2,  an  algorithm 
is  presented  which  finds  a cycle  of  minimum  length,  if  one  exists,  in 
a directed  graph  with  n vertices  in  time  0(log  n)  with  n processors. 

For  the  problem  of  finding  a cycle  of  a directed  graph  G,  no  parallel 
algorithm  has  been  found  which  is  more  efficient  than  finding  the 
shortest  cycle  of  G. 

7.1  Cycles  in  an  Undirected  Graph 

Finding  a Cycle  in  an  Undirected  Graph 

Let  G = (V,E)  be  a connected,  undirected  graph,  where 
V - {1,  . . .,  n},  and  let  S be  a spanning  tree  of  G.  If  there  is  an 
edge  e of  3 which  is  not  an  edge  of  S,  adding  e to  S creates  a cycle  in 
S.  The  algorithm  presented  here  for  finding  a cycle  of  G will  first 
find  a spanning  tree,  S,  of  G and  vertices  x and  y of  V such  that 
(x,y)  e E,  but  (x,y)  is  not  an  edge  of  S.  Next,  the  cycle  C ^ y 
created  by  adding  (x,y)  to  S,  is  computed.  If  yca(x,y)  is  the  youngest 
common  ancestor  of  x and  y in  S,  then  C^x  consists  of  the  edge  (x,y) 
and  the  simple  paths  in  S joining  yca(x,y)  with  x and  y. 
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Let  F : V-*-V  be  a function  such  that  the  directed  graph 

T = (V,  {(i  ,F(i))  | i e V,  F(i ) f i}) 

is  a directed  spanning  tree  of  G.  Then  the  undirected  graph 

S = (V,  {(1 ,F(i ))  | i e V,  i f F(i )}) 

is  a spanning  tree  of  G.  If  x and  y are  vertices  of  V,  the  youngest 
common  ancestor  of  x and  y in  S can  be  found  in  the  following  way. 

(a)  Construct  the  adjacency  matrix  of  T. 

* 

(b)  Find  My,  the  transitive  closure  of  My.  Then  for  i,j  e V,  j is  a 

★ 

descendant  of  i in  S if  and  only  if  Mj(j,i)  = 1. 

★ 

(c)  Define  a relation  R on  V by  jRi  if  and  only  if  M^(j,i)  = 1.  Then 

yca(x,y)  = minR  (i  e V | xRi  and  yRi}. 

I 

Lenina  16.  Let  S = (V,E  ) be  a tree  and  F : V-*-V  a function  such  that 
e'  = {(1»F(i))  | i e V,  i t F(i)}. 

Then  for  a given  pair  x,y  of  V,  yca(x,y)  can  be  computed  in  time  O(logn) 

2 

with  n processors. 

Proof:  Steps  (a)  through  (c)  above  compute  yca(x,y)  from  F. 

o 

Step  (a)  can  be  done  in  constant  time  with  n processors,  step  (b),  in 

time  O(logn)  with  n processors,  by  Theorem  8,  and  step  (c),  in  time 

O(logn)  with  Tn/lognl  processors,  as  in  Section  2.2.  Thus  yca(x.y)  can 

o 

be  computed  in  time  O(logn)  with  n processors.  QED 

Corollary.  Let  S and  F be  as  in  Lemma  16.  Then  yca(i,j)  can  be  computed 

2 

for  all  i,j  e V in  time  O(logn)  with  n fn/lognl  processors. 
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Proof:  In  step  (c),  preceding  Lemma  16,  for  alj_  i ,j  e V let 

yca(i.j)  = minR  {v  e V | iRv  and  jRv). 

Then  steps  (a)  through  (c)  compute  yca(i.j)  for  all  i ,j  e V.  Since  the 

2 

new  step  (c)  can  be  done  in  time  O(logn)  with  n Tn/lognl  processors,  the 
corollary  follows.  QED 

To  find  vertices  x and  y of  V such  that  (x,y)  e E,  but  (x,y) 

I 

is  not  an  edge  of  S * (V,E  ),  define  g : V-*V,  for  i e V,  by 
/o,  if  no  edge  in  E - E is  incident  with  i 

9(1)  -i 

(min  {j  e V | (i,j)  e E - E },  otherwise. 


If  g(i)  = 0 for  all  i e V,  then  G contains  no  cycles.  Otherwise,  let 
h = min  {i  e V | g(i)  t 0}.  Then  (h,g(h))  is  an  edge  of  G which  is  not 
in  S.  Let  w * yca(h,g(h)).  Let  r and  s be  the  smallest  integers  such 
that  Fr(h)  * w and  Fs(g(h) ) * w.  Then  the  cycle 

h.F(h),  . . Fr(h),  Fs-1(g(h)),  . . .,  F(g(h) ) , g(h),  h 


is  the  cycle  of  G created  by  adding  (h,g(h))  to  S. 


Define  C 


Ch.g(h) ) 


V+V,  for  i e V,  by 
'F(i),  if  i = Fk(h)  and  0 i k < r-1 


'(h,g(h))(i)  = ^"^(h)),  if  i « Fk(g(h))  and  1 < k < s 
0,  otherwise. 


I 


Then  the  sequence 
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C(h.9(h))(t1>*  C(M(h))(h)’ 


C(M(h))<h> 


consists  of  the  vertices  of  the  cycle  created  by  adding  (h,g(h))  to  S, 
in  the  order  in  which  they  occur,  followed  by  a sequence  of  zeroes.  The 
entire  algorithm  is  presented  and  analyzed  below. 


ALGORITHM  CYCLE 


Let  G * (V,E)  be  a connected,  undirected  graph,  where 
V = (1,  . . .,  n}.  From  the  adjacency  matrix.  A,  of  G, 
find  a cycle  of  G in  the  following  way. 

1.  Find  a function  F : V+V  so  that  the  graph 

> 

T * (V,  {(i  ,F(i ) | i e V,  i f F(1 )}) 

is  a directed  spanning  tree  of  G and  let  M^  be  the  adjacency  matrix  of 

T.  (By  Theorem  6 and  the  corollary  to  Theorem  8,  this  can  be  done  in 
2 2 

time  0(log  n)  with  n processors.) 

♦ 

2.  Compute  M^.  (By  Theorem  8,  this  can  be  done  in  time  O(logn)  with 
2 

n processors. ) 

3.  For  each  i e V with  F(i)  i,  replace  A(1,F(i))  and  A(F(i),i)  by  0. 
(constant  time,  n processors) 

4.  For  each  i e V,  let 

!0,  if  the  vector  [A ( i , 1 ) , . . .,  A(i,n)]  does  not 
| contain  a 1 

min  (j  e V | A ( i , j ) = 1),  otherwise. 


(By  the  results  of  Section  2.2,  step  4 can  be  done  in  time  O(logn)  with 
2 

n processors.) 

5.  If  the  vector  [g(l),  ....  g(n)]  has  no  nonzero  entry,  stop,  and 
indicate  that  G has  no  cycles.  Otherwise,  let  h = min  {i  e V | g(i)  + 0}. 
(It  was  shown  in  Section  2.2  that  this  can  be  done  in  time  0(logn)  with 

n processors.) 

6.  Find  yca(h,g(h)).  (By  Lemma  16,  this  can  be  done  in  time  O(logn) 
with  n processors.) 

7.  For  all  i e V,  compute  g(h))  as  ^°^ows-  First,  let  C^h  g^^(i) 

0 for  all  i e V.  Then, 

(a)  if  i ? yca(h,g(h))  and  MT(h,i)  * 1 and  MT(i,yca(h,g(h)))  - 1 
replace  C(h>g(h))(i)  by  F(i). 

(b)  if  i / yca(h,g(h))  and  Mr(g(h) ,i ) = 1 and  Mj( i ,yca(h,g(h) ) ) 
1,  replace  C(M(h))  (F(i ))  by  i. 

(Step  7 can  be  done  in  constant  time  with  n processors. 

8.  Compute  g(h))^h^  for  k = 0,  . . . , n-1 . (By  Lemma  12,  this  can 
be  done  in  time  0(logn)  with  n processors.) 

Thus,  ALGORITHM  CYCLE  can  be  performed  in  time  0(log2n)  with 
n2  processors.  The  best  serial  algorithm  for  finding  a cycle  in  an 
undirected  graph  uses  an  adjacency  list  representation  of  the  graph  and 

has  a v/orst  case  time  complexity  of  0(n2)  [21].  Thus,  ALGORITHM  CYCLE 

2 2 . 
achieves  a speedup  of  0(n  /log  n)  over  the  best  serial  algorithm  with 

2 

an  efficiency  of  0(1/1 og  n). 
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Finding  a Cycle  Basis 

Let  G = (V,E)  be  an  undirected  graph.  Let  H = (VH,EH)  and 
K = (Vj^,Ek)  be  subgraphs  of  G.  The  symmetric  difference  of  H and  K, 

I I I 

written  H © K,  is  the  subgraph  G = (V  ,E  ) of  G where 

I 

E = {e  | e e Eh  U Ek  and  e t EH  A EK) 
and 

I l 

V = {v  e V | v is  incident  with  some  edge  of  E }. 

A set  of  fundamental  cycles  of  G is  a collection,  C,  of  cycles  of  G 
with  the  property  that  any  cycle  C of  G can  be  written  as  C = C-|  Q . . . 
©Ck  for  some  cycles  C-j,  . . .,  C^  e C [21].  A cycle  basis  for  G is 
a minimal  collection  of  fundamental  cycles  of  G.  It  can  be  shown  that 
if  G has  t connected  components,  any  cycle  basis  for  G has  m - n + t 
elements,  where  m is  the  number  of  edges  of  G and  n,  the  number  of  vertices 
([4],  p.  25).  Let  S be  a spanning  forest  for  G.  Let  (e-| , . . .,  em_n+t^ 
be  the  edges  of  G which  are  not  in  S.  Let  be  the  cycle  of  G created 
by  adding  edge  e^  to  S.  Then  C = {C-j,  . . .,  cm_n+t)  is  a set  of 
fundamental  cycles  for  G (see  [21])  of  cardinality  m-n+t  and  thus  C 
is  a cycle  basis  for  G. 

Let  G = (V,E)  be  a connected,  undirected  graph  with 
V = {1,  . . .,  «}.  ALGORITHM  CYCLE  BASIS  below  finds  a cycle  basis  for 

I 

G in  the  following  way.  First,  a spanning  tree  S = (V,E  ) of  G is  found. 

I 

Then,  for  each  x,y  e V such  that  (x,y)  e E and  (x,y)  t E , the  cycle 
C(x  y)’  °‘,ta^ned  adding  edge  (x,y)  to  S is  computed  as  in  ALGORITHM 
CYCLE. 
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ALGORITHM  CYCLE  BASIS 

Let  G = (V,E)  be  a connected,  undirected  graph,  where 

V = {1,  . . .,  n).  Find  a cycle  basis  of  G in  the  following 

way. 

1.  Find  a function  F : V-*-V  such  that  the  graph 

T » (V,  {(i  ,F(i ))  | i e V,  i f F(i )}) 

is  a directed  spanning  tree  of  G.  (By  the  corollary  to  Theorem  8,  this 

2 2 

can  be  done  in  time  0(log  n)  with  n processors.) 

2.  Find  yea ( i , j ) for  all  i ,j  e V.  (By  the  corollary  to  Lemma  16,  this 

p 

can  be  done  in  time  O(logn)  with  n Tn/lognl  processors.) 

3.  For  each  i e V with  F(i)  f i,  replace  A(i,F(i))  and  A(F(i),i)  by  0. 

I 

Then  A(x,y)  = 1 if  and  only  if  (x,y)  e E - E . (constant  time,  n pro- 
cessors) 

4.  For  each  x,y  e V with  A(x,y)  = 1,  find  C^x  ^(x)  for  k * 0,  . . . , n-1 
using  steps  7 and  8 of  ALGORITHM  CYCLE.  (Since  steps  7 and  8 of  ALGORITHM 
CYCLE  can  be  done  in  time  O(logn)  with  n processors,  step  4 can  be  done 

3 

in  time  O(logn)  with  n processors.) 

If  it  is  desired,  a procedure  like  the  one  which  stored  K bicon- 
nected  components  in  a K x n array  can  be  used  to  store  the  cycles  in  the 
set 

<C(x>y)  I <x.y>  « E - t'} 


in  an  (m-n+1)  x n array. 
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ALGORITHM  CYCLE  BASIS  has  time  complexity  0(log2n)  and  processor 
3 

complexity  n . The  best  serial  algorithm  for  this  problem  uses  an 

adjacency  list  representation  of  the  graph  and  has  time  complexity 

O(n^)  [21].  The  parallel  algorithm  for  finding  a cycle  basis  achieves 

a speedup  of  0(n  /log  n)  over  the  serial  algorithm,  with  an  efficiency 
o 

of  0( 1 / 1 og  n).  Thus  a very  large  speedup  is  achieved  by  the  parallel 
algorithm  with  relatively  little  waste. 

7 . 2 Shortest  Cycles 

Let  G = (V,E)  be  a simple,  directed  graph  with  V = {1,  . . .,  n}. 

The  algorithm  of  this  section  finds  a cycle  of  minimum  length  in  G 
2 2 

in  time  0(log  n)  with  n Tn/logn]  processors.  It  is  based  on  knowledge 
of  two  matrices  associated  with  G.  The  shortest  path  matrix  of  G is  an 
n x n matrix  P where  for  i ,j  e V, 

!®,  if  there  is  no  path  in  G from  i to  j 
d(i,j),  the  length  of  the  shortest  path  in  G from 
i to  j,  otherwise. 

A next  vertex  matrix  of  G is  an  n x n matrix  N,  where  for  i , j e V, 

f0,  if  there  is  no  path  in  G from  i to  j 
i»  if  i = j 

k,  where  the  edge  (i,k)  is  on  a shortest  path  in  G 


from  i to  j,  otherwise. 


I 
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Notice  that  if  N(i,j)  f 0,  then  the  path 


i,  N(i,j),  N(N(i  ,j ) , j ) , . . . , j 


is  a shortest  path  in  G from  i to  j.  It  will  be  shown  in  Lemma  17  that 

P and  N can  be  calculated  from  the  adjacency  matrix.  A,  of  G in  time 
2 2 

0(1 og  n)  with  n fn/lognl  processors  by  the  following  steps. 


Calculation  of  P and  N from  A 


1.  For  all  i e V,  let 


1 > if  A(i  ,j ) = 1 


PQ(i,j)  =\0,  if  i = j 

l®,  otherwise 


i,  if  i = j 


NQ(i,j)  =(j,  if  A ( i ,j ) = 1 
\0,  otherwise 


i » If  i - j 


mQ(i  ,j)  =(j,  if  A(i  ,j)  = 1 
lO,  otherwise 


(constant  time,  n processors) 

2.  Let  t = 0.  (constant  time,  1 processor) 

3.  If  t > log(n-l),  stop,  and  let  P = Priog(n.1)1  and  N = Nfl  (n.1)r 

o 

Otherwise,  increment  t by  1.  (constant  time,  n processors) 


4.  For  all  i ,j  e V,  let 


Pt(i»j)  = min  {Pt_-|(i,k)  + Pt_-|(k,j)  | k e V}. 

(Since  the  minimum  of  n numbers  can  be  found  in  time  O(logn)  with  rn/lognl 

2 

processors,  step  4 can  be  done  in  time  O(logn)  with  nTn/lognl  processors.) 

5.  For  each  i,j  e V,  let 


ii , if  Pt(1  ,j)  ■ 0 
0,  if  Pt(i  ,j)  = 00 
min  {k  e V \ k f i and  P^( i ,j ) 
otherwise. 


pt_l ( i »k)  + pt_i (k, j ) >, 


Note  that  if  0 < Pt(i,j)  < <*>,  then  there  is  a k e V,  k f i,  such  that 
Pt(i,j)  = Pt_-|(i.k)  + Pfc_i  (k,j) . As  in  step  4,  this  can  be  done  in  time 
0(logn)  with  n rn/lognl  processors.) 

6.  For  each  i ,j  e V,  let 


0,  if  mt(i  ,j)  = 0 


Nt(i,j)  =< 

»mt(i  ,j)),  otherwise. 

2 

Then  return  to  step  3.  (constant  time,  n processors) 

Lemma  17.  Let  G = (V,E)  be  a simple,  directed  graph,  with  V = {1 , . . .,  n), 

and  let  P be  the  shortest  path  matrix  of  G,  and  N the  next  vertex  matrix 

of  G.  Then  P and  N can  be  computed  from  A,  the  adjacency  matrix  of 
2 2 

G,  in  time  0 ( 1 og n)  with  n Pn/lognl  processors. 


i 


Proof:  It  will  be  shown  first  that  steps  1 through  6 above 


compute  P and  N. 


Claim:  For  t = 0,  . . riog(n-l)!  and  i , j e V, 


(a)  Pt(i»j)  a 


P(i,j),  if  there  is  a path  in  G from  i to  j 
of  length  at  most  2t 


otherwise 


(b)  Nt(i,j) 


i . if  pt(i»J')  = 0 
0,  if  pt(i,j)  - » 


=(k,  where  the  edge  (i,k)  of  G is  on  a shortest 
( path  in  G from  i to  j,  otherwise. 


The  claim  will  be  proven  by  induction  on  t.  If  t = 0,  the  claim  is 
true.  Assume  that  for  some  t > 0,  the  claim  is  true. 

(a)  If  there  is  no  path  from  i to  j in  G of  length  at  most  2t+^ , then  by 

the  induction  hypothesis,  for  all  k e V,  either  Pfc ( i , k)  = « or  Pt(k,j)  = 
so  that  Pt+^(i,j)  = Otherwise,  let  q be  a shortest  path  from  i to  j 

in  G of  length  at  most  2t+\  Then  there  is  a vertex  k of  q such  that 

P(i  ,k)  < 2t , P(k,j)  < 2*  and  P(i,j)  = P(i,k)  + P(k,j).  Thus,  by  the 
induction  hypothesis, 


P(i  »j)  = Pt(i,k)  + Pt(k,j)  = Pt+1(1,j). 

(b)  If  Pt+1(i,j)  = 0,  then  mt+1(i)  * i and  Nt+1(i,j)  = N t ( i , i ) = i.  If 

Pt+1  ( 1 » j ) = °°»  then  mt+-|(i)  = °*  thus  Nt+l^,J)  = 0<  If 
0 < pt+]  ( i > J ) < “»  then 
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Nt+1(i,j)  = Nt(i ,mt+1(i ,j)). 

By  the  induction  hypothesis,  N^(i ,mt+^ (i  ,j ) ) is  a vertex  k t i,  such 
that  the  edge  (i,k)  is  on  a shortest  path  in  G from  i to  ( i * j ) * Since 
is  a vertex  distinct  from  i on  a shortest  path  in  G from  i to 
j.  Nt+^(i,j)  = k,  and  (i,k)  is  an  edge  on  a shortest  path  in  G from  i 
to  j.  This  completes  the  proof  of  the  claim. 

Since  the  length  of  the  shortest  path  in  G from  i to  j , for  any 
i , j e V,  is  at  most  n-1,  it  follows  from  the  claim  that  pfj0g (n_i  )1  = p 
and  Nriog(n-l)l  = N’  Thus»  the  steps  1 through  6 preceding  the  lemma 
compute  P and  N.  It  remains  to  find  the  time  and  processor  complexities 

2 

of  steps  1 through  6.  Steps  1 and  2 can  be  done  in  constant  time  with  n 
processors.  Steps  3 through  6 are  iterated  at  most  riog(n-l)!  + 1 times 

p 

and  one  iteration  can  be  done  in  time  0(logn)  with  n Tn/lognl  processors. 

2 2 

Thus,  P and  N can  be  calculated  in  time  0(1 og  n)  with  n Tn/lognl  processors. 
QED 

The  computation  of  a shortest  cycle  of  G from  the  matrices  P 
and  N is  based  on  the  following  lenma. 

Lemma  18.  If  C = (VC,EC)  is  a shortest  cycle  of  a directed  graph  G, 
where  Vc  * {v^ , . . . , v^},  and 

Ec  ~ { ( v.j  »v.- ) | 1 ~ 1»  • • •*  k-1 } {(vk.Vi)}, 

then  the  path  q from  v-|  to  vk  in  C is  a shortest  path  from  v^  to  vk  in  G. 

Proof:  Let  p be  a simple  path  in  G from  v^  to  vk  of  length 
less  than  the  length  of  q.  Then  p U { (vk» vi )>  a cycle  of  G,  shorter 
than  C.  This  contradicts  the  definition  of  C.  QED 
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By  Lemma  18,  a shortest  cycle  in  G may  be  computed  by  finding 
vertices  x,y  e V such  that 
— > 

(i)  (y.x)  is  an  edge  of  G and 

— > 

(ii)  P(x,y)  = min  (P(i,j)  | i ,j  e V and  (j,i)  is  an  edge  of  G}. 

Then  a shortest  cycle  in  G consists  of  the  union  of  the  edge  (y.x)  and  a 
shortest  path  in  G from  x to  y (which  can  be  calculated  from  N).  These 
steps  are  made  precise  in  ALGORITHM  SHORT  CYCLE  (D)  below. 

ALGORITHM  SHORT  CYCLE  (D) 

Let  G = (V,E)  be  a simple,  directed  graph  with  V = {1,  . . .,  n}. 
Find  a cycle  of  G of  minimum  length  in  the  following  way  from 
the  adjacency  matrix.  A,  of  G. 

1.  Compute  P,  the  shortest  path  matrix  and  N,  the  next  vertex  matrix 

2 2 

of  G.  (By  Lemma  17,  this  can  be  done  in  time  0(log  n)  with  n Tn/lognl 
processors. ) 

2.  Let 


m = min  (P(i,j)  | i ,j  e V,  A(j,i)  = 1}. 

If  m * stop,  and  indicate  that  G has  no  cycles.  (The  minimum  of  at 

2 2 
most  n numbers  can  be  found  in  time  O(logn)  with  Tn  /lognl  processors.) 

3.  For  all  i e V,  let  v^(j)  be  defined  for  each  j e V by 
A,  if  A(i,j)  = 1 and  P(i,j)  - m 

tyj)  = < 

lO,  otherwise. 

2 

(constant  time,  n processors) 
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4.  For  all  i e V,  let 

[1,  if  the  n dimensional  vector  v.-  contains  a 1 

0,  otherwise 

(This  involves  determining,  for  each  i e V,  whether  a binary  vector 

2 

contains  a 1.  Thus,  step  3 can  be  done  in  time  O(logn)  with  n processors.) 

5.  Let  x = min  {i  e V | f(i)  + 0).  (Note  that  if  m + «,  such  x and  y 
exist.)  Let 

y = min  (j  e V | A(j,x)  = 1 and  P(x,j)  = m}. 

(0(logn)  time,  [n/logn]  processors,  as  in  Section  2.2) 

6.  For  all  i e V,  let 

(N(i,y),  if  i t y 

0,  otherwise. 

(constant  time,  n processors) 

7.  Compute  C^(x)  for  t = 0,  . . .,  m.  Then  the  cycle 
x,  C1 (x) , . . .,  Cm(x)  = y,  x 

is  a cycle  of  G of  minimum  length.  (By  Lemma  12,  this  can  be  done  in 
time  O(logn)  with  n processors.) 


ALGORITHM  SHORT  CYCLE  (D)  has  time  complexity  O(log^n)  and 
processor  complexity  n Tn/lognl.  Notice  that  this  algorithm  solves  the 
problem  of  finding  a cycle  of  G.  The  best  known  serial  algorithms  for 

detecting  a cycle  in  a directed  graph  has  a worst  case  time  complexity 

2 

of  0(n  ) [15,21].  Thus  the  parallel  algorithm  achieves  a speedup  of 
2 2 

0(n  /log  n)  with  an  efficiency  of  0(l/nlogn). 
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Chapter  8 
LOWER  BOUNDS 

2 

In  previous  chapters,  algorithms  of  time  complexity  0(log  n) 
were  found  for  several  graph  problems.  It  will  now  be  shown  that  at 
least  2 log  n + c time  units  are  required  to  solve  some  of  these  problems 
on  the  unbounded  parallel  model,  when  the  adjacency  or  weight  matrix  of 
the  graph  is  used  as  input.  Thus  in  many  cases,  the  algorithms  obtained 
in  this  paper  are  within  a factor  of  log  n of  being  optimal,  as  far  as 
time  complexity  is  concerned. 

The  lower  bounds  are  obtained  in  two  steps.  First,  an  argument 
is  presented  to  show  that  the  solution  to  a given  problem  on  a graph  with 
n vertices  is  a nontrivial  function  of  at  lecst  g(n)  entries  in  the 
adjacency  matrix  of  the  graph,  for  some  function  g.  This  argument, 
whenever  possible,  is  based  on  the  Aanderaa-Rosenberg  Theorem  of 
Section  8.2.  Second,  Theorem  12  of  Section  8.1  is  used  to  show  that  at 
least  riog(g(n))l  time  units  are  required  to  compute  a quantity  which 
is  a function  of  g(n)  input  variables.  In  each  problem  considered, 
g(n)  will  be  such  that  riog(g(n))l  > 2 log  n + c,  for  some  constant  c. 

8. 1 A Fan-In  Theorem 

A binary  tree,  T,  is  a rooted  tree  in  which  each  vertex  has  at 
most  two  sons.  A vertex  of  T which  has  no  sons  is  called  a leaf.  The 
height  of  T is  the  length  of  the  longest  simple  path  in  T between  the 


i 


' 

i 
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root  of  T and  a leaf  of  T.  Note  that  a binary  tree  of  height  h can 
have  at  most  2h  leaves. 

Let  0 be  a set,  and  f a function  defined  on  Dn  for  some  positive 

integer  n.  Let  A be  an  algorithm  which  computes  the  function  f,  given 

(x^ , . . . , xp)  e Dn,  on  the  unbounded  parallel  model  in  t time  units. 

In  computing  f,  A may  compute  several  intermediate  functions.  The 

computation  of  f by  A can  be  represented  by  a binary  computation  tree, 

Ta>  with  height  t,  which  is  constructed  in  the  following  way.  At  level 

t,  place  a single  vertex,  the  root  of  T^,  labeled  f.  Assume  inductively 

that  has  been  constructed  through  level  i where  0 < i < t and  each 

vertex  at  level  i is  labeled  by  a function  of  (x^ , . . .,  xn)  which  has 

been  computed  by  algorithm  A in  at  most  i time  units.  Level  i - 1 of 

is  constructed  as  follows.  If  a vertex  v at  level  i is  labeled  g, 

where  g (x, , . . . , x ) t {x,,  . . .,  x },  then  v is  a leaf  of  T..  Other- 

i n i ti  h 

i 

wise,  there  must  be  functions  h(xp  . . . , xn)  and  h (x^,  . . . , xn) 
computed  by  A in  at  most  i - 1 time  units,  and  a binary  function  F, 
such  that 


i 

g(xi , . . .»Xp)_  F(h(x^ , . . .,  xn ),  h ( x-j , . . .,  xn)). 


In  this  case,  two  vertices  labeled  h and  h are  added  to  level  i - 1 of 

■ 

Ta,  along  with  the  edges  (g,h)  and  (g,h  ). 

A 5 

As  an  example,  let  f (x, , . . xc)  * T x..  Then  f is  com- 

1 b i=l  1 


I 

i 


puted  by  the  algorithm  below. 
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Algorithm  A 

Input:  x^ , . . . , x<. 

1.  Assign  one  processor  to  the  pair  (x-j^)  and  one  processor  to  the 
pair  (x^.Xg)  to  compute  x-|  + X2  and  x^  + x,j. 

2.  Assign  one  processor  to  the  pair  (x-|+X2.x3)  to  compute  (X1+X2)  + x^. 

3.  Assign  one  processor  to  the  pair  ( ( x^  +X2 ) + x^,  x^+x^)  to  compute 

((x1+x2)  + x3)  + (x4+x5)  = f(xj,  . . x5). 

Algorithm  A computes  f in  3 time  units.  A binary  computation 
tree  for  A is  shown  in  Figure  5. 

For  i e {1 , . . . n},  f(x^,  . . xn)  is  a nontrivial  function 
of  xi  if  there  are  elements  (a^,  . . . , an)  and  (b^  . . bn)  of  Dn 
such  that  ai  t b^  but  aj  = b^  for  j t i,  and  f(a-|,  . . an)  t 

f (b-j , . . . 1 bR ) . 

Theorem  12.  Let  f(x^,  ....  xp)  be  a nontrivial  function  of  each  of 
the  variables  x-| , . . . , xn.  Then  any  algorithm  which  computes  f,  given 
x^,  • • . , xn>  on  the  unbounded  parallel  model,  has  time  complexity  at 
least  Tlognl. 

Proof:  Let  A be  any  algorithm  which  computes  f and  let  T^ 
be  the  binary  computation  tree  for  A.  Since  f is  a nontrivial  function 
of  Xj,  for  1 * 1,  ....  n,  there  must  be  a path  in  T^  from  the  root,  f, 
to  a leaf  labeled  x^.  Thus,  TA  must  have  at  least  n leaves.  Since  a 
binary  tree  of  height  h has  at  most  2^  leaves,  the  height  of  Tft  is  at 
least  logn,  and  thus  at  least  Tlognl,  since  the  height  is  an  integer. 
Thus,  A requires  at  least  flognl  time  units  to  compute  f.  QED 


8.2  The  Aanderaa-Rosenberq  Theorem 


Let  Pn  be  a property  of  undirected  graphs  with  n vertices. 

For  example,  Pp  may  be  the  property  "being  connected"  or  "being  planar." 

Pn  is  called  nontrivial  if  there  is  some  graph  with  n vertices  which  has 
property  P^  and  some  graph  with  n vertices  which  does  not  have  property 
P^.  "Being  planar"  is  a nontrivial  property  of  graphs  with  n t 5 vertices. 
Pn  is  monotone  if  whenever  a graph  G = (V,E)  with  n vertices  has  property 

i i • 

P , then  any  graph  G = (V,E  ),  where  E ? E , also  has  property  Pn- 
P is  symmetric  under  vertex  permutation  if  whenever  a graph  G = (V,E) 

I 

has  property  P , and  p : V-*-V  is  a permutation  of  V,  then  the  graph  G = 

l 

(V,E  ) also  has  property  P , where 


E « {(p(x).p(y))  | (x,y)  e E}. 


The  following  theorem  was  conjectured  by  Aanderaa  and  Rosenberg 
[23]  and  has  been  proven  by  Rivest  and  Vuillemin  [22]. 


Theorem  13.  Let  Pn  be  a nontrivial,  monotone  property  of  undirected 

graphs  with  n vertices,  which  is  symmetric  under  vertex  permutation. 

For  an  undirected  graph  G,  with  n vertices,  let  fp  be  a function  of  the 

n 

entries  of  the  adjacency  matrix  of  G which  has  the  value  1,  if  G has 

property  P , and  has  value  0,  otherwise.  Then  fp  is  a nontrivial  func- 
2 n 

tion  of  at  least  cn  entries  of  the  adjacency  matrix  of  G,  for  some 


constant  c. 


The  following  lemma  is  easily  verified. 
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Lemma  19.  The  following  properties  of  undirected  graphs  with  n vertices 
are  nontrivial,  monotone,  and  symmetric  under  vertex  permutation: 

(i)  being  biconnected,  for  n > 2 

(ii)  being  bridge  connected,  for  n > 2,  and 

(iii)  possessing  a cycle,  for  n > 3. 

I 

8.3  Lower  Bound  Results 

1 Using  the  results  of  the  previous  two  sections,  the  following 

lower  bounds  are  obtained. 

Theorem  14.  On  the  unbounded  parallel  model,  at  least  2 log  n + c time 
units,  for  some  constant  c,  are  required  to  determine 

(i)  if  a given  undirected  graph  G,  with  n > 2 vertices, 
is  biconnected, 

(ii)  if  a given  undirected  graph  G,  with  n > 2 vertices,  is 
bridge  connected,  and 

(iii)  if  a given  undirected  graph  G,  with  n > 3 vertices, 
contains  a cycle, 

, . if  the  adjacency  matrix  of  G is  used  as  the  input. 

Proof : (i)  Let  f have  the  value  1,  if  G is  biconnected,  0, 

otherwise.  Any  algorithm  which  determines  if  G is  biconnected,  using  the 

adjacency  matrix  of  G as  input,  computes  f as  a function  of  the  entries 

in  the  adjacency  matrix.  By  Lemma  19  (i)  and  Theorem  13,  f is  a nontri- 

2 

vial  function  of  at  least  kn  input  variables  for  some  constant  k.  Thus, 
by  Theorem  12,  at  least 


I 

I 
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riog(kn^)!  > 2 log  n + log  k 

time  units  are  required  to  compute  f. 

The  proofs  of  (ii)  and  (iii)  are  analogous.  QED 
To  obtain  lower  bounds  on  the  time  required  to  determine  if  a 
directed  graph  contains  a cycle,  to  find  the  dominators  of  a rooted, 
directed  graph,  and  to  find  a minimum  spanning  tree  of  a weighted  graph, 
different  techniques  must  be  used...  The  first  lemma  below  is  due  to  Holt 
and  Reingold  [13]. 

Lemma  20.  Let  G be  a directed  graph  with. n vertices  and  let  f be  a 
function  of  the  entries  in  the  adjacency  matrix  of  G which  has  the  value 
1 if  G contains  a cycle,  0,  otherwise.  Then  f is  a nontrivial  function 

v 

of  at  least  n(n+l)/2  entries  in  the  adjacency  matrix  of  G. 

Proof:  See  [13].  QED  , j 

| 1 

Lemma  21 . Let  (G,r)  be  a rooted  graph,  where  G = ( V, E ) , with  n 2 7 
vertices  and  let  i,j  e V.  Let  f^  ^ be  a function  of  the  entries  in 
the  adjacency  matrix  of  G which  has  the  value  1,  if  i is  a dominator  ; 

of  j in  (G,r)  and  0,  otherwise.  Then  f^  is  a nontrivial  function  of  at  . 

least  (n-l)(n-3)/4  entries  in  the  adjacency  matrix  of  G.  i 

Proof:  The  technique  of  proof  is  to  find  a rooted,  directed 

» 

graph  G in  which  it  is  particularly  hard,  for  some  pair  of  vertices  i,j, 
to  determine  if  i is  a dominator  of  j,  from  the  adjacency  matrix  of  G. 

For  k > 5,  let  G^  = (V,E)  be  the  directed  graph  with 

j 
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V = {r , u i * • • • » ^k»  » • • • * ^k^ 

and 

— > — > > > 

E = { (r  ,u-j ),  (r,  v-j ) jl  U { (u^- ) , ( v,.  ) | i = 1,  . . .,  k-1). 

Then  (G ,r)  is  a rooted  graph  with  n = 2k  + 1 vertices.  Let 
i = v-j  and  j = vfc.  Then  i is  a dominator  of  j in  (Gk,r)  thus  = 1. 

i * 

Let  U * (u-|,  . . .,  uk>  and  V = {v2 vk>.  Let  A be  the  adjacency 

matrix  of  Gk<  Claim:  f^  j is  a nontrivial  function  of  A(x,y)  for  each 
pair  (x,y)  e U x V . To  see  this,  note  that  for  (x,y)  e U x V , A(x,y)  = 

I • 

0.  Let  A be  the  n x n matrix  where  for  (u,'')  e V x V, 


A (u »v) 


if  (u,v)  = (x,y) 

A(u,v),  otherwise. 


Since  i is  a dominator  of  j in  (Gk,r)  but  not  in  the  graph  with  adjacency  ma- 

1 1 I 

trix  A , f^.  j)(A)  t f^  j j (A  ),  proving  the  claim.  Since  |u  | = (n-l)/2 
and  | V | = (n-3)/2,  f^  ^ is  a nontrivial  function  of  at  least  (n-l)(n-3)/4 
entries  in  the  adjacency  matrix. 


Lemma  22.  Let  G be  a connected,  undirected,  weighted  graph  with  n > 3 

vertices.  Let  (x,y)  be  an  edge  of  G and  let  f^x  j be  a function  of  the 

weight  matrix  of  G which  has  the  value  1,  if  (x,y)  is  in  a minimum 

spanning  tree  of  G,  and  0,  otherwise.  Then  f^x  ^ is  a nontrivial  func- 
o 

tion  of  at  least  (n  -5)/4  entries  in  the  weight  matrix  of  G. 
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Proof:  The  technique  of  this  proof  is  similar  to  that  of  the 

previ"*"  lemma.  For  n > 3,  let  = (V,E)  be  the  complete  graph  on  n 

vertices  and  let  (x,y)  e E.  Let  r = in/2 J and  t = Tn/21.  Partition  V 

into  two  disjoint  sets,  V = {u, , . . .,  u } and  V = {v. , . . . , v }, 

xi  i y • t 

of  r and  t elements,  respectively,  such  that  x e V and  y e V . For 

* y 

z = x,y,  let  be  the  set  of  edges  of  a spanning  tree  of  the  complete 
subgraph  of  Kn  on  the  vertices  of  Vz*  Assign  weights  to  the  edges  of  Kn 
as  follows.  For  e e E, 

, if  e e SxUSy 

, if  e = (x,y) 

, otherwise. 

Then  S = S1  U U {(x,y)}  is  the  set  of  edges  of  a MST  for  Kn  with 
weight  function  w and,  in  fact,  is  unique.  Note  that  f^x  j = 1 since 
(x,y)  is  in  a MST  of  Kn- 

Let  W be  the  weight  matrix  of  Kn  with  weight  function  w. 

Claim:  For  each  (u,v)  e x Sy,  such  that  (u,v)  ? (x,y),  f^x  ^ is  a 

nontrivial  function  of  W(u,v).  To  see  this,  let  (u  ,v  ) e S x S , 

x y 

i i i 

(u  ,v  ) f (x,y),  and  let  W be  the  n x n matrix  defined  by 

6 , if  (u  ,v  ) * (u,v) 

iii  \ 

W (u  ,v  ) - S 

I W(u  ,v  ),  otherwise. 

i * 

If  G is  the  graph  whose  weight  matrix  is  W , (x,y)  is  not  an  edge  of  a 

I I 

MST  of  G . Thus,  f^x  y)(W)  t f(x  y)(W  ),  proving  the  claim.  Since 
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|SJ  = Ln/2 J and  |Sy|  » Tn/21,  |(SxxSy)  - {(x,y)}  | = Ln/2J  Tn/21-1  * 

2 2 
(n  -5)/4.  Thus,  f^x  is  a nontrivial  function  of  at  least  (n  -5)/4 

entries  in  the  weight  matrix.  QED 

Lemmas  20  through  22  are  combined  with  Theorem  12  of  Section 

8.1  to  give  the  following  lower  bound  results. 

Theorem  15.  On  an  unbounded  parallel  model,  at  least  21ogn  - c time 
units,  for  some  constant  c,  are  required  to  determine: 

(i)  for  a given  directed  graph  G with  n vertices,  whether  G 
contains  a cycle,  using  the  adjacency  matrix  of  G as  input, 

(ii)  for  a given  rooted  graph  (G,r),  where  G = (V,E), 

with  n > 5 vertices  and  for  i ,j  e V,  whether  i is  a dominator  of  j in  (G,r) 
using  the  adjacency  matrix  of  G as  input,  and 

(iii)  for  a given  connected,  undirected,  weighted  graph  G, 
with  n > 3 vertices,  the  weight  matrix  of  a MST  of  G,  using  the 
weight  matrix  of  G as  input. 

Proof:  (i)  Any  algorithm,  which  determines  whether  or  not  G 
contains  a cycle,  computes  a function  f,  where  f = 1 if  G contains  a 
cycle,  and  f = 0,  otherwise.  By  Lemma  20,  f is  a nontrivial  function  of 
n(n+l)/2  Input  variables.  Thus  by  Theorem  12,  at  least  Tl og (n ( n+1 )/2)l 
time  units  are  required  to  compute  f from  the  adjacency  matrix  of  G.  The 
theorem  follows  by  noting  that 

riog(n(n+2)/2)l  i logn  + log(n+l)  - 2 > 2 logn  - 2. 


(ii)  This  follows  as  in  (i)  with  the  observation  that 
riog(n-l )(n-3)/4)l  l log(n-l)  + log(n-3)  - 2 > 2 logn 


for  n > 5. 

(iii)  This  follows  as  in  (i)  by  observing  that 
riog(n2-5)/4l  > log(n2-5)  - 2 > 2 logn  - 4. 


for  n > 3. 


Chapter  9 


CONCLUSION 

It  has  been  demonstrated  in  this  paper  that  in  solving  certain 

graph  problems,  operations  can  be  performed  in  parallel  to  substantially 

reduce  the  computation  time.  Using  the  unbounded  parallel  model  of  com- 

2 

putation  of  Section  1.1,  parallel  algorithms  of  time  complexity  0( 1 og  n) 
have  been  developed  to  solve  the  problems  of  finding  minimum  spanning 
trees,  spanning  trees,  bi connected  components,  dominators,  bridges, 
cr  dge  connected  components,  cycles,  cycle  bases,  and  shortest  cycles. 

In  contrast,  the  best  sequential  algorithms  for  all  of  these  problems, 
except  finding  a cycle  basis,  have  a worst  case  time  complexity  of 
0(n  ).  The  best  sequential  algorithm  for  finding  a cycle  basis  has  time 
complexity  0(n  ). 

When  the  number  of  processors,  P(n),  used  by  a parallel  algorithm 
to  achieve  a time  complexity,  T(n),  is  considered,  it  is  possible  that 
much  waste  is  involved,  that  is,  that  P(n)*T(n)  is  much  greater  than  the 
time  complexity  of  the  best  serial  algorithm.  It  has  been  shown  that 
the  algorithms  presented  here  for  finding  minimum  spanning  trees,  bridges, 
and  cycle  bases  are  not  only  "fast,"  but  involve  relatively  little  waste 
as  well. 

In  Chapter  8,  lower  bounds  of  2T1 ognl  + c were  obtained  for 
some  of  the  graph  problems  listed  above.  Thus,  several  algorithms  pre- 
sented in  this  paper  have  time  complexities  which  are  optimal,  at  least 
to  within  a factor  of  logn. 


~~~  *1 
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The  results  in  this  paper  indicate  that  graph  theoretic  problems 
are  capable  of  being  solved  efficiently  on  parallel  computers.  However, 
the  design  of  a specific  computer  will  severely  influence  the  time  re- 
quired to  solve  a given  problem.  An  analysis  of  the  techniques  used  in 
the  algorithms  which  have  been  presented  here  may  produce  ideas  as  to 
what  design  characteristics  may  be  desirable  in  a parallel  computer  which 
must  solve  a large  number  of  graph  problems. 
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